
Indian AaRiQUUttfltA|» 
Rbseaqck tv^TiTtrre, 


1.A.E.I4> 

atp iiDK<i.g;4 iJWit.t-ie-5 S5>-tMi| 




PROCEEDINGS 

OF THE 

ROYAL SOCIETY OF LONDON 


SERIES A. MATHEMATICAL AND PHYSICAL SCIENCES 


VOL 198 


LONDON 

Published for the Royal Society by the 
Cambridge University Press 
Bentley House, N.W.i 


7 September 1949 



Printed in Great Britain at the Univcreity Press ^ Cmrihridge 
{Brooke Orutchley, University Printer) 
and published hy the Cambridge Unimrsity Press 
Gambridge^ and Bentley Homey London 

Agents for Cemada and India: Macmillan 



CONTENTS 


SERIES A VOLUME 198 
No. A 1052. 22 July 1949 

PAGE 

The molecular orbital theory of chemical valency. I. The determination of 


molecular orbitak. By Sir John Lennard-Jones, F.R.S 1 

The molecular orbital theory of chemical valency. II. Equivalent orbitals in 

molecules of known symmetry. By Sir John Lennard-Jones, F.R.S. . , 14 

A note on polar air-mass modification. By R. Frost 27 

Unified field theory in a curvature-free five-dimensional manifold. By J. G. Beimett, 

R. L. Brown and M. W. Thring 39 

The heats of formation of free ON and free CHg, and the relationship between 

D(CO), D(CN) and D(N^). By L. H. Long 62 

The behaviour of waves on tidal streams. By N. F. Barber .... 81 

Kinetic theory of diffusion in gases and liquids. I. Diffusion and the Brownian 

motion. By L. M. Yang 94 

Eddy diffusion of water vapour and heat near the ground. By F. Pasquill . , 116 

No. A 1063. 16 August 1949 

The Royal Greenwich Observatory. By Sir Harold Spencer Jones, F.R.S. (Plates 

lto 4) . • ' 141 

A note on the Riesas method and the method of residues. By F. 0. Auluck and 

D, S. Kothari 170 

Hot-wire investigation of the wake behind cylinders at low Reynolds numbers. 

By L. S. G. Kovdsznay 174 


Plastic deformation of silver chloride. I. Internal stresses and the glide mechanism. 

By J. F. Nye. (Plates 6 to 8) 190 

One-dimensional dislocations. I, Static theory. By F. C. Frank and J. H. van 


der Merwe . . . . . 206 

One-dimensional dislocations. II. Misfitting monolayers and oriented overgrowth. 

By F. C. Frank and J. H. van der Merwe 216 


The rate of evaporation of droplets. II. The influence of changes of temperature 
and of the surrounding gas o,n the rate of evaporation of drops of di-^-butyl 
phthalate. By J. Birks and R. S. Bradley 226 

The rate of evaporation of droplets. III. Vapour pressures and rates of evaporation 

of straight-chain paraffin hydrocarbons. By R. S. Bradley and A. D. Shellard 239 

The autoxidation of tetralin. By 0. H. Bamford and M. J. S. Dewar . ; . 252 

The dissociation energy of the N-N bond in hydrazine. By M. Szwarc . . 267 

The dissociation energy of the C-N bond in benzylamine. By M. Szwarc . . ’ 286 



IV 


Contents 


No. A 1064. 22 August 1949 

Th.e work of tke National Institute for Medical Research. By Sir Charles Haring* 
ton, F.R.S. (Plates 9 and 10) 

Kinetics of the base-catalyzed halogenation of some ketones and esters. By 

R. P. BeU, F.R.S., B. GeUes and Eva MoUer 

The catal 3 rbio hydrogenation of methyl elaeostearate, and of mixtures of olaeoateario 
with other polyethenoid long-chain esters. By T. D. Hilditch, F.R.S. and 

S. P. Pathak 

Initiation of solid explosives by impact and friction: the influence of grit. By 
F. P. Bowden, F.R.S. and 0. A. Gurton 

Birth and growth of explosion in liquids and solids initiated by impact and friction. 
By F. P. Bowden, F.R.S. and 0. A. Gurton. (Plates 11 to 14) . 

Influence of entrapped gas on initiation of explosion in liquids and solids. By 
A.Yofie. (Plate 15) 

The effect of diffusion of the main reactants on flame speeds in gases. By J. Corner 

Paramagnetic resonance in the copper Tutton salts. By B. Bleaney, R. P. Penrose 
and Betty I. Plumpton 

The theory of plane plastic strain for anisotropic metals. By R. Hill . 

A theory of the film phenomena of liquid helium n. By H. N. V. Temperley 


No. A 1065. 7 September 1949 

Spontaneous emulsification of pure xylene in an aqueous solutioti throtigh tnor(» 
adsorption of a detergent in the interface. By A. Kaminski and ■ 
McBain, P.R,S. (Plate 16) 

The diffraction of blast. I. By M. J. Tii g b tbill 

' Kinetic theory of diffusion in gases and liquids. II. General kinetic theory of 
liquids mixtures. By L. M. Yang 

The behaviour of continuous stanchions. By J. F. Baker, M. R. Horne and J. W. 
Roderick 

Studies in polymerization. V. The polymerization of vinyl acetate. By G. Dixon- 
Lewis _ 

The ignition of solid explosive media by hot ■svires. By E. Jones 

Heisenberg’s 8 matrix for a system of many particles. By R. J. Eden 

Penetration of magnetic field into superconductors. II, Measurements by the 
Casimir method. By E. Laurmann and D. Shoenberg ..... 

Electroviscosity TV. Some extensions of the theory of flow of liouids in narrow 
channels. By G. A. Elton and F.G.Hirschler . . 

Index 


PAQK 

29:J 

308 

823 

337 

m) 

873 

388 

406 

428 

438 

447 

454 

471 

493 

510 

523 

540 

560 

681 

690 



Corrigendum 

Proceedings of the Royal Society A, vol. 197, plates 5 and 6 

The illustrations for plates 5 and 6 should be transposed, but the legends should 
remain as printed, i.e. the illustrations described in the legend to plate 5 actually 
appear on plate 6 and vice versa. 

The Royal Society regrets the inconvenience caused to the reader. 




The molecular orbital theory of chemical valency 
1. The determination of molecular orbitals 

By Sib John Lennaed-Jones, F.R.S. 

{Received 20 August 1948, publication delayed at request of author) 


In the molecular orbital theory of valency the electrons are assigned to the whole molecule 
rather than to atoms or to other localized parts. While the method has advantages in dealing 
with the properties of a molecule as a whole, such as its energy states, the extension of each 
orbital over the molecular framework is a disadvantage when dealing with localized pro- 
perties such as directed bonds. This paper deals in a general way with the equations which 
molecular orbitals must satisfy, allowing for the exchange of electrons between orbitals. It is 
then shown that when molecules have properties of symmetry the equations can be trans- 
formed so as to be satisfied by orbitals which have the property of equivalence. These can be 
regarded under certain conditions as directed orbitals and the conditions for these are 
discussed. To illustrate the method molecules of the type are considered. 


1. Introditotiok 

•Modern theories of chemical valency are based on the principles of Vave mechanics 
and the theories have met with success so long as it has been possible to apply these 
principles satisfactorily to problems of molecular structure. The difficulties and 
limitations of existing theories arise not so much from any intrinsic deficiency in the 
principles involved but rather in the intractable nature of the calculations to which 
they lead* It is thus a central problem in theoretical chemistry to try to simplify 
wherever possible the methods of calculations and to reduce to their simplest form 
the wave equations which have to be solved. The object of this and the following 
paper is to obtain the equations which must be ’satisfied by the orbitals which 
electrons occupy in molecules and to try to transform them by the use of the 
symmetry properties of molecules to a more convenient form. 

There are at present two main methods of calculation. One of them, called the 
electron pair method, is essentially a generalization of the calculation used by 
Heitler and London for the hydrogen molecule. The essential feature of this treat- 
ment is that when the electrons of interacting atoms are paired in spin a molecular 
state is obtained which is lower in energy than that of the isolated atoms. The develop- 
ment of this method has resulted in a close understanding of the structure of a wide 
range of molecules and has led to a quantitative treatment, though necessarily an 
approximate one, of the interactions of atoms in molecules. One advantage of this 
method is that it is based on various ^states’ of a molecule such as can be repre- 
sented by conventional structural formulae. The results can be interpreted in terms 
of these basic structures and provide the chemist with a picture of a molecular state as 
a superposition of acceptable structural formulae. A further advantage of the method 
is that it permits the use of directed atomic orbitals (Pauling 1931, 1946; Slater 
1931) and introduces the concept of directed valencies. The criterion used in pre- 
paring an atom for interaction with other atoms is that the wave function should 
be as concentrated as possible in thd direction of union. It is possible to combine 


Vol. 1-98. A. (33 July 1949) 


1 



2 Sir John Lennard- Jones 

(or hybridize) atomic wave functions such that a number of equivalent orbitals 
can be regarded as directed in particular directions. Thus the carbon atom can be 
prepared to have four equivalent orbitals in tetrahedral directions, or alternatively 
to have three equivalent orbitals in trigonal directions with a fourth orbital in a 
direction at right angles to the plane of the other three. 

These two characteristic features of the electron pair theory have led to its 
extensive use in the theory of molecular straoture and to its general acceptance by 
chemists as a satisfying picture of the processes at work in determining the properties 
of molecules. 

The other main method of calculating the properties of molecules is called the 
molecular orbital theory. It was first used for diatomic molecules (Lennard- Jones 
1929) and later extended to ethylene and conjugated molecules (Hiickel 1930, 1937). 
It has been extensively used and developed by Mulliken (1932a, 19326, 1933, 194^ ) 
who has been particularly successful in interpreting the spectroscopic properties 
of molecules in terms of the individual orbitals to which the electrons in a molecule 
have to be assigned. Selection rules have been obtained in an elegant way from the 
symmetry properties of the electronic orbitals. 

In this method the electrons are supposed assigned to the molecule as a 
whole, consisting as it does of nuclei in a certain symmetrical arrangement relative 
to each other and other electrons in certain orbitals consistent with that symmetry. 
This method has marked advantages in dealing with the energy states of certain 
molecules, particularly conjugated molecules, for it determines the sequence of 
energy levels of the individual electrons in molecules just as in atoms and thus 
indicates the sequence of closed shells of electrons. In conformity with the Pauli 
exclusion principle the electrons can be assigned to the orbitals to give the state of 
lowest energy, or in some oases excited states, and so the energy content of the 
molecule can be calculated relative to any convenient standard. 

Each of the two main methods of calculating the structure of molecules has its 
characteristic advantages and often both are used, the one to supplement the other. 
The calculatioiis based on the methods are necessarily approximate at present, but 
in spite of this the results obtained for molecular energies are in most cases in general 
agreement. 

The molecular orbital method has the merit of being more fundamental in its 
approach, for it treats electrons as belonging to the whole molecule rather than to 
atoms or to other localized parts of the molecule. But the extension of each orbital 
over the molecular framework is a disadvantage when it is necessary to deal with 
localized properties such as .localized bonds. In this and the following paper the 
foundations of the orbital theory are examined and equations obtained which the 
molecular orbitals must satisfy when interchanges between orbitals are allowed. 
It is shown that when molecules possess properties of symmetry these equations 
can be transformed in such a way that they are satisfied by orbitals which have the 
property of equivalence. Such equivalent orbitals are identical as regards their 
distribution and differ only in their orientation; they are, in fact, directed orbitals 
which can be regarded as forming localized bonds. The method by which these 
results are obtained is based on the group theory, which is a powerful aid in dealing 



3 


The molecular orbital theory of chemical valency. I 

with molecules of known symmetry. The equations satisfied by equivalent orbitals 
are simpler than those satisfied by molecular orbitals in that they involve functions 
of only one type and occur in sets so that the solution of any one implies a solution 
of the rest. To illustrate the method a simple example is worked out in this paper, 
a molecule of type X¥^ being considered, but it is hoped to deal more generally 
with equivalent orbitals in a later paper and to give the equations which such 
orbitals must satisfy. Recent advances in computing devices encourage the hope 
that eventually it may be possible to solve these equations, as has already been done 
for atoms, and so by an inverse transformation to derive molecular orbitals. 


2. Molecular orbital wave eunotioks 

The wave equation in atomic co-ordinates for a molecule containing N electrons 
and having nuclei of atomic numbers Zp is (neglecting spin) 

= ( 2 - 01 ) 

j>k ] 

where = - (I) - E (2-02) 

a 

and E is the energy of the electrons, the total energy being 

Za is the atomic number of a typical nucleus and the distance of they'th electron 

from it. The wave function T and the energy of the electrons E will, of course, depend 
on the nuclear configurations determined by the nuclear distances 
The molecular orbital method of approximating to the solution of the above equa- 
tion is to suppose that another equation can be found of the type 

[|^jy|-^j''F = 0, (2-03) 

where Hf — — (I) V ^- — Vp (2-04) 

and is a function of the jth set of co-ordinates alone. The equation (2-03) can then 
be solved in terms of functions which satisfy 

(£rf-e)^ = 0. (2*05) 

This is an equation in three dimensional space and its solutions - with energies 

ei, ^ 2 , ... give distributions in ordinary space which are called molecular orbitals. 
The solution of the equation (2-03) then consists of products of the molecular orbital 
wave functions, or sums of such products and the possible energy values consist of 
sums of the energies € 2 , ... of individual orbitals. 

With each molecular orbital we may associate one or other of the wave functions 
in the spin co-ordinates, denoted by a{cr) and ^{cr), the co-ordinate cr denoting spin 
and taking the two values ± 1. The spin functions may be taken to be orthogonal 
and normalized in the usual way. 



4 Sir John Lennard- Jones 

The wave function of the N electron system, which satisfies (2-03), is then, 
including spin, of the type 

fl) = det{5i^i(l)a{l)^ii(2)y?(2)...}, (2-06) 

and the energy is the sum of the energy contributions of the individual orbitals 
corresponding to which are assumed occupied by electrons. Since according 

to the exclusion principle not more than two electrons occupy each orbital, the 

energy is givm by (2-07) 

r 

where is equal to 0, 1, or 2 according to the number of electrons in the orbital of 
energy e^. In this summation it has been supposed that all the orbitals have different 
energies, but if some of them should have coincident energies as in cases of degen- 
eracy, then the above expression must be generafized accordingly, the permissible 
values of Cf being those given above for each member of a degenerate set. 

There will usually be many solutions of the t3q)e (2-06) for the same energy (2-07) 
owing to the possibility of assi gnin g electrons to different orbitals with the same 
energy. 

One of the great advantages of this method is that the electrons are conceived as 
moving in potentials fields Vj which depend on the position of each electron alone, 
the effect of the other electrons on the one considered being assumed averaged over 
their appropriate distributions; The field Vj is assumed to be similar to that of the 
nuclear framework and so is unchanged when equivalent nuclei are supposed inter- 
changed by any symmetry operation. The powerful methods of the group theory then 
prCvide much information about the symmetry properties of the orbitals and so of the 
composite wave functions (2-06). These symmetry properties permit a determination 
of the possible changes from one molecular state to another and so give the selection 
rules for the absorption and emission of light. The energies of the orbitals for fields 
Vj of different symmetries appropriate to a wide range of molecules have been worked 
out. They usually involve integrals involving Vj and so cannot be determined unless 
the field is known, but the difl&culty can often be side-stepped by regarding the 
integrals as parameters which can be determined from one observable property and 
then used to evaluate others. 

3. A MOBB Q-EKBEAIi TEBATMBNT OB THE OBBITAL THEOBY 

The disadvantage of the method just outlined is that it does not make clear what 
is the relation between the equation (2-03) and the actual one (2*01) which is to be 
solved. A similar difficulty arose in dealing with the structure of atoms. At first 
electrons were dealt with one at a time and the method of self-consistent fields was 
devised and used extensively by Hartree {1927, 1930, 1947). Later an improvement 
was introduced by F ock (1930) who used the variation method to transform equation 
(2-01) involving ZN co-ordinates to N equations each in three co-ordinates. This 
treatment can be applied equally weU to molecules. The method to be used iu this 
paper is that given by the author (1931) in the theory of atoms containing many 
electrons. 



5 


The molecular orbital theory of chemical valency. I 

It is assumed that the solution of the wave equation of a molecule can be expressed 
in terms of functions of the co-ordinates of individual electrons (that is, without the 
introduction of the relative co-ordinates of electrons). Tho most general way of 
expressing the wave functions of aU the electrons is by sums of products of individual 
wave functions and, in particular, to get the right properties of antisymmetry in 
the co-ordinates (Pauli principle) by a determinant of the type (2-06). When there 
are many such determinants corresponding to the same total orbital energy, then 
it may be necessary to consider sums of these. For the ground state of a molecule, 
or for any state which corresponds to a set of complete electron shells, one deter- 
minant is sufficient. It is also possible to deal by a similar treatment with an electronic 
configuration of a molecule corresponding to a number of closed shells and a number 
of electrons outside these shells in different orbitals with the same spin. Such a 
configuration corresponds to a term of multiplicity {28 -f- 1), where 8 is the number 
of outer electrons. 

To cover this latter case we suppose that there are p orbitals in the closed shells, 
each occupied by an electron of a and /? spin, and {q—p) outer electrons each with 
a spin a (or /?). We then have altogether q electrons with a spin and p with /?. 
The determinant is then 


<I> = det{V^l(l)a(crl)i^r•2(2)a((r2)...^^g(g)a(<rg)^^^l(2+l)/?((r2+l)...f^(iV)/?(crJv)}, (3-01) 

We have so far made no assumptions about the ^’s, the orbital wave functions, but 
it is possible to derive from the last p functions a set which are orthogonal to each 
other. Such a transformation wiU not alter €> except by a numerical factor. Similar 
linear combinations can be made of the first p functions containing a instead of ^ 
spin factors. Finally linear combinations of the functions so derived and the 
remaining {q—p) functions can be made so that all the functions used in<l> are ortho- 
gonal, either because of their spatial properties or because of their spin factors. 
Normalization of each of the component functions will only alter 0 by a numerical 
factor and so we may suppose this adjustment made without loss of generality. 

The Euler variation equation corresponding to the wave equation (2*01) is 


or 


\ZH, + = 0 , 


(3-02) ‘ 
(3-03) 


where 8^ indicates a variation of functions containing the ith co-ordinates wherever 
they appear in 

Now $<1> = yo = det{p(i, j)}, (3-04) 

where (3-06) 

Z«1 


and the functions <j>i include an orbital factor ^ and a spin factor. There p»re N such 
functions, the first q containing an a spin factor and the remainder a yff-spin factor. 
The determinant in (3*04) contains N rows and columns, i and j separately taking 
aU values from 1 to N. 



6 


Sir John Lennard-Jones 


Equation (3'03) can be written in this notation as 

J(sS,) + = E^{^l}^pdr, (3-06) 

where now indicates a variation of the functions ^ whenever they contain the ith 

co-ordinates in p. Thus v 

= (3-07) 

z=i 


Similarly Hj operates on the (j) functions in p which contain the Jth co-ordinates, 
so that ^ _ 

H^p{i,3) = ZmSM3)> ( 3 - 08 ) 

and l^H^pii, 3) = S mU)- (3-09) 

z=i 

Thus ^iJB^p represents a determinant in which terms such as those of (3*07) and 
(3-08) occur in the ith row and the jth column, the common member being (3-09). 
If the terms occurring in this determinant are denoted by w^j-, where i indicates the 
row and j the column, then . 

\<^ik0^kidr^ = (3-10) 


for all values of i and j including the ith row and the jth column. It is thus possible 
to integrate Hj) p. over aU variables except the ith and jth and the result is 


dr,- dr 4 


^(^jHj}pdr={N-^\j^,Hj\ 

= 8 H^Jsf^ir^dx], (3-11) 

l\m / n J in,n J , ) 


P{i,i) P{i,3) 
PUJ) PU>3) 


where in the last equation the integration over the spin co-ordinates has been carried 
out and the symbol 8 represents a summation over those pairs of functions, 

m,n 

which have the same spin factor in <5 ; those with different spin factors are orthogonal 
and vanish on integration. The space co-ordinates for the remaining integrals are 
denoted by x, a snflEix no longer being necessary. In the above equation 


Hmn = J ir„H^,^dx. (3- 12) 

In a similar way it is found that 

jB.Hjpdr = {N- 1)! zjsf^Ht^dx, (3-13) 

(3-14) 

fn¥) (V^) 


where 


(3-16) 



The molecular orbital theory of chemical valency. 1 7 

is the potential of a distribution at a point whose co-ordinates are represented, 

by X. V{x) is the potential due to all the distributions 

F(«) = S (3-16) 

n 


Similarly p can readily be integrated over aU variables except the ith, ^’th 
and Ifcth and p is reduced to a three column determinant in these variables. The 
remaining integrations then lead to the equation 


f(s ( S 0-lrjk)pdr = i{N\) [Yt{lrn\Q\lm)- S {l'm\0\ mZ)| 2 {dijr^fn^ 

J \ i J \j>Jc U,m Z,m ) nj 

■ - m '-) (s -Sf (Zm 1 I In) - S {lm\Q\ JiZ)] fsf^ir^dx, 

\ I m,n l,m,n 

{lm\G\ nl) = f^{h) fi{h) (1/r^*) dx^dzy. 


where 


(3-17) 

(3-18) 


and similar integrals for the other terms. The summation 8 is over those sets of 

I, m^n 

three functions which have the same spin in O. The energy E is given by 


E = + (l/?-yfc)jpdT-HjpdT 




S (J>rn \0\lm)— 8 (hn | 0 
h w Z, m 


I 


(3-19) 


When the above results are substituted in equation (3-06), we obtain an integral 
containing the differentials each multiplied by a factor. If equation (3-06) is 
to be satisfied for all variations the factors must separately vanish, and the 
result is a set of linear differential equations in three co-ordinates instead of 
Schrodinger’s equation in ZN co-ordinates. These equations are 

[H+V (a:)] e„ i/r^{z) - 8^‘^'> | G^n(x) + £[^^ + -^{1^,1 G\ln)\ 
mV I J 

+ 8^'^\lm \ G\rd)^„ = 0, (3-20) 

l,m 

where is the number of times ijr^ occurs in <I> and is either 1 or 2; 8^‘^'> and 8^'^'> are 

m Z,9n 

summations over those functions and ‘>lri which have the same spin factor as 
in <I>. For a molecule containing only closed shells with every orbital occupied 
by an electron with a spin and one with ^ spin, we have equal to 2 and 

iN iN iN 

^W = 2S. ;S(») = 2SS- 

m 1 Lm 11 


The set of equations (3-20), being equivalent to the wave equation (2-01), is 
accurate subject to the assumption of a solution of the type (3-01). 



8 


Sir John Lennard- Jones 
4. Moleoxjlbs or known symmetby 


In attempting to solve the equations (3*25) we are to remember that the nuclear 
framework will in many molecules of interest have certain properties of symmetry. 
This is implied in the properties of the Hamiltonian R, which is given by 

H = ’ (4=-01) 

a 

where is the distance of a point from the nucleus of atomic charge 

If we confine the present discussion to a molecule in its normal state consisting 
of closed shells, equation (3*20) becomes 

+ ( 4 - 02 ) 

m 

where + j:>{lm\0\ln)-'^{l'm\&\ nl) (4-03) 

z z 

and = (4-04) 

m 

and S' Tised in (4-02) is a summation over aU space orbitals, counted once only, 

m 

omitting the orbital while S* is a summation over all space orbitals, counted 
twice, except which occurs only once. 

The equations (4-02) can also be written in the abbreviated form 

(«>)W = 0, ‘ (4-06) 

where (^) is an operator matrix, the diagonal elements being 

= H + (4-06) 

and the remainder being - ^mn ; (4-07) 

(ijr) denotes the array of occupied ijr functions. 

If equation {4-02) is multiplied on the left by and integrated over the whole 
of space, the result given in (4-03) is obtained. is clearly the energy of an electron 
in the orbital moving in the presence of the field of the nuclei {Hnn) and the 
field of the other electrons, averaged over their respective charge distributions 

I G-ynn |)j wifh an additional contribution | ~ S' | <3^ | owing to the 

field O^Jpc), which arises from the possibility of interchange between one orbital 
and another. 

The total energy of the electrons is not obtained simply by adding the energies 
of the individual orbitals because the electrostatic contributions, represented by 
the last two terms in would be counted twice; it is given by equation (3T9). 

A good approximation to the solution of the equations (4-02) for atoms has been 
obtained by neglecting the right-hand side of the equation , and consequently also 
the non-diagonal matrix elements (ln\G\ nl) which occur in E^n owing to the 
presence of justification for this step it may be observed that whereas 

all the members G„^{x) occurring in V'^ix) are positive in all parts of space, the 
functions G^J^x) are positive in some regions of space and negative in others; in 



9 


The molecular orbital theory of chemical valency. I 

fact G,nn{^) is the electrostatic potential at any point due to a unit charge distributed 
over space according to and the latter function, when integrated over the 

whole of space, vanishes because of the orthogonal property. For a similar reason 
the only remaining factor of on the right-hand side is also neglected. 

We may adopt a similar procedure for molecules and attempt as a first approxi- 
mation to find solutions of rrr Tr// ^ 

{H + V'n(x)-<^^}f^ = 0 (4-08) 

with = + (4:-09) 

The methods of group theory can be applied to equation (4-06) to fiLnd the type of 
solution for a field H of given symmetry. It is one of the results of this theory 
that tl;e electron distribution of closed shells has the same symmetry as that of the 
nuclear framework. Thus all distributions non-degenerate orbitals 

have this symmetry and also when belongs to a degenerate set. It 

k 

follows that V'n{x) in (4-06) has the same S 3 nnmetry as H for all orbitals which 
are non-degenerate. When belongs to a degenerate set, Vn(x) may not have the 
same synunetry properties, but in this case we may suppose the rest of the 
functions ^ same set so averaged to produce the right symmetry. This 

k 

procedure was necessary in many electron atoms when an electron in a p or d orbital 
was being considered. We shall accordingly suppose that the operator in equation 
(4-06) has the same symmetry properties as jff in all cases. 

If, for example, the potential field has the symmetry Cg® like a triangular molecule 
of the type XT^, which has an axis of symmetry (z), a plane of symmetry through 
it (yz) at right angles to the plane of the nuclei and a plane of symmetry {xz) through 
the nuclei, the possible types of orbital conform to the scheme in table 1. There are 
four types of orbital and their behaviour, when rotated through 180° about the 
axis of zlC'a), or refiected in the xz plane {crj, or reflected in the yz plane (cr'), is given 
by the numbers in the table. All the orbitals in this example are single, and so V'ni^) 
always has the symmetry of the group. 

Table 1. Types op molecxtlae oebital in a pield op symmetey 0 ^^ 



B 

^^2 


< 


1 

1 

1 

1 


1 

1 

-1 

-1 

B , 

1 

-1 

1 

-1 


1 

-1 

-1 

1 


We may thus imagine a set of solutions of equation (4-08) worked out conforming 
to the appropriate symmetry requirements such as are given in table 1. Such 
solutions of different symmetries will be orthogonal to each other and so will satisfy 
the conditions laid down in § 3 above. The determinantal wave function defined 
in equation (3-01), must be built up of orbital wave functions satisfying the right 
symmetry requirements as laid down by the group theory. 

Once solutions of this kind have been worked out, the more accurate equations 
(4-02) can be solved by a process of successive approximation. Thus the appropriate 



10 


Sir John Lennard-Jones 


functions can be substituted on the right-hand side and Ofnni^) evalu- 

ated, and then the right-hand side regarded as a perturbation of the which 
satisfies the left-hand side alone. 

Though the functions substituted on the right-hand side may have symmetry 
properties which are different from ifr^, we note that has the same 

symmetry as if belongs to a non-degenerate set. If belongs to a degenerate 
set, then S„j 'P'm summed over the members of the set has the same symmetry 
as Further, if and ijr^ have different symmetry properties, then vanishes . 
Hence the right-hand side of (4-02) has the same symmetry as and only those 
numbers remain for which m and n have the same symmetry. 

Thus a set of functions may be derived which solve the equations (4-02) in the 
sense that they give self-consistency and each function has the same symmetry 
properties as the approximate solution derived from (4-08). Each of the orbitals so 
determined will have an energy 

Kn = + \ 0\ln) -'Z {lrt\ & \ nl) (4'-10) 

and the total energy of the system will be given by equation (3-19). 

5. Dibboteu orbitals 

The functions determined by the above method give distributions which extend 
throughout the nuclear framework and are thus molecular orbitals in the sense in 
which that term is widely used. For spectroscopic purposes molecular orbitals have 
advantages in that the energy levels of the individual electrons may be supposed 
known so that electron transitions to other excited states may be understood and 
described, while the selection rules may be easily deduced from the symmetry pro- 
perties of the orbitals. From the point of view of valency the method has dis- 
advantages in that the chemist is accustomed to think of electrons in chemical 
bonds as localized. Also the bonds are known from experiment to be orientated in 
definite directions. This feature has been well reproduced by the method of Pauling 
(1931) and by Slater (1931) of taking linear combinations of atomic orbitals and 
obtaining orbitals with directional properties. Thus an 8 and three p orbitals can be 
combined to give an orthogonal set of four equivalent orbitals directed towards the 
vertices of a regular tetrahedron (cf. Hultgren 1932; KimbaU 1940 for further 
examples) . An illuminating discussion of the particular case of the methane molecule 
has been given by Van Vleck (1933), assuming that molecular orbitals can be 
expressed as linear sums of atomic orbitals. The treatment has recently been taken 
further in an interesting way for this and other molecules by Coulson (1949). It 
would be an advantage if directed orbitals could be obtained in terms of molecular 
orbitals and shown to be part of the same logical scheme. It is the object of this 
and the following paper to deal with this problem. 

We consider in the first place a simple example of a molecule such as XY^ of C^,^, 
symmetry and inquire whether two equivalent orbitals can be derived, each associ- 
ated with a Z r bond. Let the two orbitals be denoted by and Xz- Then under the 
operations of the group, as given in table 1, we find that Xx and Xa change as in table 2. 



11 


The molecular orbital theory of chemical valency. I 
Table 2. The bbhaviotjr oe equivalent orbitals 

IN A FIELD OE SYMMETRY 



E 


O-v 

< 

Xi 

Xi 

* X 2 

Xi 

X 2 

Xz 

X 2 

Xi 

X 2 

Xi 

(Xi. Xz) 

2 

0 

* 2 

0 


The character scheme of the set Xi} Xi is given in the last line. From this it follows 
by comparison Avith table 1 that the set must be made up of orbitals of types 
and We suppose therefore, that there are two molecular orbitals, which we label 
as and tlr^, one being of symmetry and the other of For simplicity we will 
consider these orbitals alone apart from any others which may be occupied in the 
molecule, and suppose that each contains two electrons, one of a-spin and the other 
of /?-spin. The equations satisfied by and ^2 obtained from (4-02), viz. 

{H+V'y{x)-Ey}ty = 0^{x)ir,\ 

{H + V'^ix) -E^ilr^= Qy^{x) 

V[{x) being the electrostatic field on an electron in orbital and similarly for 
F 2 (a:). Ey and E^ are here written for and E^. 

Noait Xy and x% S'^® be hnear sums of ilr-y and ijr^ and are to be orthogonal. If 
and ^2 3’’^® s-lso supposed normalized, then Xx Xu. satisfy these conditions and 
are also normalized, if 

= (1/V2) + Xz = (1/V2) {fx-fz)- (5-02) 

It follows that Xx and Xz equally well be used in the molecular wave function <I> 
and the same methods used to determine the equations satisfied by and Xz- 
Thus from equation (4-02) we have at once 

{H + v{{x) - eij} Xx = {(Tzii^) + 621 } Xz> ' (^'03) 

{ 9 'i 2 (*) + 612 } Xx = {H + v'^{x) - 622 } X 2 > (^’04) 

where v[ {x) contains terms such as 

= J Xi(»') Xi(*') (1/^) 

and 6^2 is the energy term derived from (4-03) using the x functions. Thus 

ei2 = Ai 2 + S (^1 b I ^2) - S (Z1 k I 2Z), (5-06) 

i i 

where i ^^2 the matrix element of H Avith respect to the x functions and {kX \g\tJ,v) 
similarly means an integral of (l/?‘i 2 ) over the appropriate x functions. 

The equations (6-01) for the ir functions are similar in form to those above, except 
that Ey^ vanishes because the two ijr functions have different symmetry properties. 
On the other hand we find that 


(and Hy^ = 0). 


^11 — i(-®il + -^22) “ ^22> 
^2 ~ \{^xx~Tlzz) 


(5-07) 

(6-08) 



12 


Sir John Lennard-Jones 


The electrostatic contribution to the energy terms is invariant under the trans- 
formation and 

(12 1 fit 1 12 ) -(12131 21 ) = (12 1 O ' 1 12 ) - (12 1 (? 1 21 ), 
so that we have = Cja = i(-®i + ( 6 - 10 ) 

whereas 612 = ^21 = i(^i— ■E2)- (S'H) 

The equations for the x’s therefore become 

{H+v[{x) — ^{jEi + Il2)}Xi — {9zii^) + ^i^i~^2)}Xz> (6-12) 

{H + 'v'^{x) — ^{Ei + B2)}Xi = {9i2{x>) + ^(Ei~ E^)}xi\ 

The same result can, of course, be obtained by a direct transformation of the 
equations (6-01), using the relations 


9n(^) — i(^ii+ ^ 22 ) + ^i2> 
9 ' 22 (») = + 6 * 22 ) - <? 12 . • 

= 9'2i(«) = i(^ll-^22)v 


{5-13) 


The interesting feature of equations (6-12) is that they are identical except for 
notation. Xz is similar to Xi for it is its mirror image in the yz plane of symmetry 
of the molecule. It is to be noted also that the function 312(3:) involves the product 
of Xi S'lid Xz i^ fiio integrand, and when there is little overlap, it will be small. If 
further the orbitals ijr-^ and ijr^ have nearly the same energies, (Ej^—E^) will be small, 
so that the whole of the right-hand side of (6'12) will be small. Hence to a first 
approximation we may write 

{H-¥v[{x) ~E)Xx = ^ (5-14) 

and the problem is reduced to finding a localized orbital (doubly occupied) in one 
bond only XT’ of the molecule XTa subject to the electrostatic effect of the other Y 
nucleus and a similar distribution in the other XY bond. Since the latter bond 
will partly screen the other Y nucleus, the effect of the second bond may perhaps 
be represented by a simple electrostatic distribution such as a dipole. The localized 
orbital determined by (6- 14) may thus be regarded as a molecular orbital embracing 
two nuclei and perturbed by the presence of a similar orbital in the second bond. 

In order that the x directed orbitals may differ appreciably from the molecular 
orbitals, it is necessary that and ijr^ should overlap to a substantial amount. If, 
for example, and ‘<jr^ were states which had energies differing widely from each 
other, then the nodal surfaces of one would be more concentrated near the nuclei 
than the other. In the outer parts of the molecule there would be little difference 
between Xx a-nd Xz- The terms and \{Ex - E^) occurring on the right-hand side 
of equation (6-12) would then both be large and the approximation equation (5-14) 
would no longer be vaM. Hence we see that the condition for directed orbitals is 
that the energy difference {E^ — E^) between molecular orbitals must be small. 

To obtain directed orbitals from molecular orbitals all that has been done is to 
apply certain mathematical transformations; the molecule remains exactly as 
before. The molecule can thus be described equally weU by molecular orbitals or 



IS 


The molecular orbital theory of chemical valency. 1 

by directed orbitals. From the wave theory point of view either is equally valid. 
It is rather a question of what particular property of the molecule we are interested 
in. If it is the energy levels and the differences in energy due to excitation, then the 
molecular orbital method gives the answer. We are not then concerned so much with 
the location of the electrons and they may be dispersed throughout the molecule. But 
when it is a question of the electron distribution in bonds, then a superposition of 
orbitals to give directed properties becomes appropriate. The energy of individual 
electrons is then of less importance and a knowledge of distribution implies less 
precise information about energy. But if an electron is excited, it is not possible to 
say from which (directed) bond it came.. It may have come from any one of a set of 
equivalent bonds which is another way of saying that it has been excited from 
a molecular orbital. 


Refbeeitces 

Coulson, C. A. 1949 J. Chim, Phys. (in the Press — ^the author is indebted for an advance 
copy of the manuscript.) 

Fock, V. 1930 Z. Phys. 61, 126. 

Hartree, D. K. 1928 Proc> Camh, Phil* 80c* 24, 89, 111. 

Hartree, D. R. 1930 Proc. Camh. Phil* 80c. 25, 225. 

Hartree, D. R. 1947 For full references see Reports on Progress in Physics , 11, 113. 

Htickel, E. 1930 Z* Phys. 60, 423. 

Htickel, E. 1937 Z* Elektrochem* 43, 752, 827. 

Hultgren, R. 1932 Phys. Rev* 40, 891. 

Kimball, G. E. 1940 J. Ohem* Phys* 8, 188. 

Lennard- Jones, J. E. 1929 Trans* Faraday 8oc. 25, 668. 

Lennard- Jones, J. E. 1931 Proc. Camh. Phil. 80c. 27, 469. 

Mulliken, R. S. 1932 ct Phys, Rev. 40, 65. 

Mulliken, R- S. 19326 Phys. Rev. 41, 49, 761. 

Mulliken, R. S. 1933 Phys* Rev. 43, 279. 

Mulliken, R. S. 1941 For further references see Reports on Progress in Physics^ 8, 231. 
Pauling, L. 1931 */• Amer. Qhem* 80c* 53, 1367. 

Pauling, L. 1946 The Nature of the Chemical Bond. Cornell University Press. 

Slater, J. C. 1931 Phys. Rev. 37, 481. 

Van Vleck, J. H. 1933 J- Ohem. Phys. 1, 177, 219. 



The molecular orbital theory of chemical valency 
II. Equivalent orbitals in molecules of known symmetry 

By Sib John- Lenkabd-Jonbs, F.R.S. 

{Received 17 December 1948) 

The paper aims at providing a connecting link between the theory of molecular orbitals 
and the theory of localized bonds. An examination is made of the fimdamental equations 
which must be satisfied by molecular orbitals in fields of known symmetry, particularly in 
molecules of the type It is shown by the methods of the group theory that these 

equations can be transformed to others which involve sets of equivalent functions. These 
are associated with equivalent orbitals which have the property of being identical as regards 
distribution in space and differ only in their orientation. It is shown that under certain 
conditions these can be regarded as localized orbitals associated with particular bonds. 
General formulae are obtained and applied to the particular cases of molecules of trigonal, 
tetrahedral and octahedral symmetry. 

1. Introduction 

The object of the preceding paper (Lennard-Jones 1949 ) was to find the relation 
between the electron-pair theory of valency, which deals with the interaction of 
electrons in localized bonds, and the molecular orbital theory of valency, which 
treats each electron as moving in the field of the whole molecular framework. In order 
to make progress with either of these theories, it is customary to assume, as a starting 
point, that each electron can be assigned to an orbital, which can be described by a 
function of three spatial co-ordinates. In the one case these orbitals are assumed to 
be localized between an atom and an interacting neighbour, while in the other the 
orbitals are distributed throughout the molecule, representing in fact stationary 
states of an electron in the field of all the nuclei and the averaged field of the other 
electrons. In each method the wave function of the whole molecule is expressed as 
a combination of functions of the individual electrons. To conform to Pauli's prin- 
ciple, these one-electron wave functions are combined in the form of a determinant. 

In the preceding paper equations were obtained for the molecular orbitals to which 
the electrons can be assigned in a molecule in its normal, unexcited state. It was 
then shown that in a symmetrical molecule such as XT^ these equations could be 
transformed so as to be satisfied by a distribution about one of the bonds X Y and 
a similar distribution about the other XT bond. These distributions were called 
equivalent orbitals, because each was the mirror image of the other in the plane of 
symmetry of the molecule. In this paper the results are generalized to apply to 
a molecule of the type X in which there is a central atom X and 71 other atoms or 
groups distributed in some symmetrical way about it. 

2. Equations satisfied by molecular orbitals 

Equations were given in part I (equation (3-20)) to determine the wave functions 
of the molecular orbitals of a molecule either in its ground state or in an excited state, 
provided that this state arises from closed shells and outer electrons possessing the 

[ 14 ] 



15 


The molecular orbital theory of chemical valency. II 


same spin. While the substance of the present paper will be concerned mainly with 
molecules in their ground states, or more accurately with those which consist of a set 
of closed shells of paired electrons, it may be of interest to consider the form of the 
equations for the more general case. 

When there are more electrons with one kind of spin than another the equations 
for molecular orbitals are unsymmetrical, and it is necessary to be careful as to the 
precise meaning to be attached to each term and each summation. An alternative 
way of writing the equations (3*20) of part I is as follows: 

[H + V{x)-\ + ei) f^{z) - S (65^^55 + (*) + + JUfm 

m 

+ S = 0. (2-01) 

m 

Iff these equations = 1 or 0; = 1 or 0, (2-02) 

according as is associated with an a spin or not; and similarly with a y? spin or not. 
H is the Schrodinger operator for a single electron moving in the field of the nuclei 
of the molecule and its matrix element relative to and The definition of 


the various operators is as follows: 

(Ih,-) (2-03) 

where and stand for the spatial co-ordinates of two electrons, 

y{x) = Z{eI+el)Gu{x), (2-04) 

Jmn = S ef (Z to I 0 1 In), (2-05) 

= (2-06) 

with (/cA 1 (? I /iv) = J {Ijrij) f^ixD f^ixffdxidx-^. {2-07) 


We note, in fact, that is the average value of G„^„(x) when taken over all orbitals 

occupied by electrons with a spin, viz. 


Tcc 

^ mn 






(2-08) 


There are similar definitions of I^mn I^mn sums over orbitals containing 
electrons with aov j3 spins. 

If we suppose that there are p orbitals which are occupied by electrons with both 
a and y? spins, and that there are q outer electrons with one kind of spin only, say 
a spin, then it is possible to express the equations (2-01), of which there are (p + q) 


altogether, in the form 


(^)(f) = 0, 


(2-09) 


where (J^) is an array- of operators in matrix form and {i/r) represents the set of 
(p+q) orbitals, expressed as a single-column matrix. The matrix (^) is of (p + q) 
dimensions and takes the form , 


where ^ is of dimensions and ^ of q dimensions. 



16 


Sir John Leimard- Jones 
If we “Writ© nnip^") ” J'(^) ^mth “ ^mn"^ 

we can express the diagonal elements as 

■Kn = (e^+e^)(ir + FU«)-J?„,„- J«J + e%Z»„4-e^irL, (2-10) 

and the non-diagonal elements as 

^mn~ — '^mn) "t (2'11) 

The equations take a simpler and more symmetrical form when all the electrons 
are paired, for then the matrix becomes of type only. In this case we can write 


= ( 2 - 12 ) 

'^w= -2((?w>i(*) + '®jnJ> (2-13) 

where ^mn ~ ^mn~^ '^mn ^mn> 

SO that the equations are * ^nn + S' ^mn i''m — (2-15) 

m 


We note that is the energy of an electron in a stationary state in the field of the 
nuclei and the space-charge distribution of the electrons in all other orbitals, less 
a contribution due to the field i^) > ’which arises from the possibility of interchange 

of electrons between orbitals. 

We have already discussed in part I methods of solving these equations. All that 
need be said here is that if the exchange property of the electrons is neglected, the 
equations reduce to 

(2-16) 

and solutions of this equation correspond to stationary states in the field of the 
nuclei and the electrostatic field of the electrons in all other occupied orbitals, 
represented by the term FJi„(fl;). The electrostatic field due to a set of closed shells 
wfil have the symmetry of the nuclear framework, and so for non-degenerate states 
the appropriate wave function will belong to one of the irreducible representations 
of the symruetry group. There is a difficulty in the case of degenerate states, for then 
V'n^{x) includes not only the field of a set of closed shells but also the field of the 
remaining electrons in the same shell. The latter field will not, in general, have the 
same S3mmetry as the nuclear framework. To preserve the symmetry properties of 
the wave functions in degenerate sets it will be necessary to suppose the field on each 
electron in the set so averaged that each moves in the same field, and, moreover, that 
this field has the full symmetry of the molecule. The functions so obtained from 
equation (2*16) will then be self-consistent, not only in the Hartree sense, but also 
as regards their symmetry properties, for these will then conform to those of the 
permissible irreducible representations. If these solutions are used as first approxima- 
tions of equation (2-16), nothing in the subsequent calculations will upset the 
symmetry properties, though the values of the energy parameters and the wave 
functions will be changed. 



The molecular orbital theory of chemical valency. II 17 

3. Sets oe eqeiyaient orbitals 

The equations (2"01) were obtained from a determinantal wave function of the 
form 

4. = Det a(l) f^{2) y?(2) . . . - 1) - 1) ir^{2p) p{2p) 

X %+ xi^P + 1) a(2i> + 1) .. . + q) a ( 2 p + g-)}, {3-01) 

and the properties of the system will not be altered by any transformation which 
leaves this wave function unchanged. Thus any orthonorm transformation of the 
functions ijr^ to which constitute its elements, will not change €>. It is unchanged 
when the nuclear framework is subject to any of its symmetry operators, for this is 
equivalent to multipljdng the various columns of the determinant by numerical 
factors and taking linear sums. The deterroinant is unaltered except for a numerical 
factor, and for unitary transformations this factor is unity. 

Though such transformations do not alter the system intrinsically, they can be 
used to simplify the method of solution of the equations (2-01). Thus we endeavour 
to find for a molecule of known symmetry a set of orbitals which are equivalent in 
the sense that they are interchangeable under the operations of the group. It is 
possible to determine from the character table appropriate to any type of molecular 
symmetry which types of molecular orbitals can be superimposed to produce a set 
of equivalent orbitals. The method is to find the character system of the equivalent 
orbitals under all the operations of the group and then to compare the result with 
the character system of the molecular orbitals (the irreducible representations of the 
group). In general it is found that a set of equivalent orbitals for a molecule of the 
type X Tn can be expressed as a sum of a molecular orbital which has the perfect 
symmetry of the group (the identity representation, usually denoted by Pi) and 
others of single or degenerate type, which behave differently under the operations 
of the group. 

Suppose that E,A,B, ... ,P represent operations of the symmetry group tb which 
a molecule of type XY^ belongs, E being as usual the identity operator. Then the 
irreducible representations of the group are given by a set of matrices of the kind 
shown in table 1. Each entry such as {af) represents a matrix of dimensions equal to 
the degeneracy of the representation P^, and (c^) represents a unit matrix of the same 
dimensions. If P^ is the identity representation, all the entries in the first row 
are unity. 

Table 1. The representations op a symmetry group op type XY^ 



JS 

A 

B 

... P 

Tx 

M 

(%) 

ih) 

••• iPl) 


(ex) 

(%) 

if>,} 

••• (i^a) 


(6i) 

(a^) 

ih) 

... (Pi) 

A:-set 

(e„) 

(«n) 

(fin) 

... {7T„) 


If a set of equivalent orbitals Xi> X 2 Xn exists for a molecule X Y^, the effect of 

the operations E, A, B, ..., P of the group will be to interchange the orbitals 


Vol. 198. A. 


2 



18 


Sir John Lennard- Jones 

in various ways. Hence the set will he represented by a series of matrices of the 

type (e„), (a„) (tt^) shown in the last row of the table. Each will be of dimensions 

n, and each will contain only unity and zeros, unity occurring once in each row and 
once in each column. 

From a comparison of the character system of the %-set with that of the Ti, . . . , 
representations, it is possible, if a set of equivalent orbitals exists, to show that the 
same set of characters can be obtained by a superposition of some of the F’s. Thus 
we may write 

(X) - (3-02) 

where is some numerical factor, usually unity or zero. Since the dimensions of the 
matrix representation of the X'Set is n, the dimensions of the sum of the right-hand 
side must also be ». The interpretation of this result is that a set of equivalent orbitals 
can be found by superposition of molecular orbitals, whose symmetry properties 
are the same as the representations F^ included in the above formula. 

The relation (3‘02) has been used to indicate the types of directed atomic orbitals 
which can be obtained from a superposition of s, p, d, . . . , functions (Kimball 1940). 
It is equally valid for molecuLar orbitals and it is now our purpose to show that 
by its use a transformation of the equations satisfied by molecular orbitals can 
be carried out. 

Though the characters of (;\;) and S are the same, the matrices of the latter 

representation, when written out in full in their n-dinjensional form, will not 
necessarily be the same as the matrices (e„), (a„), (/?„), ...,(7r„) representing the 
;\;-set. Let the matrices of S be denoted by (J3„), ..., (P„). It follows 

from the group theory that there must be a relation between corresponding matrices 
of the form 

(4 J = T(a„) T~\ (HJ = T-^ (3-03) 

where T is some transformation operator ; it will, of course, be represented by a 
matrix of n dimensions, thus 

Further, if {1^) denotes a set of molecular orbitals whichhas the representation S 

j 

then the relation between the x’^ the ^’a is given by 

ix) == m)- (3-04) 

Using this transformation the equations to determine the equivalent orbitals (x) 
can be obtained from those satisfied by the molecular orbitals (^) given in (2-01). 
Thus the appropriate equations are 

%{Jf) {f) = 0, (3-05) 

where S indicates that every i}r function is subject to a transformation T both where 
it appears in the matrix elements of (^) and where it appears in (t/r ) ; or in other 
words, every ^ is changed to a 



19 


The molecular orbital theory of chemical valency. II 

In order to illustrate the process it will be sufficient to consider the form taken by 
(3-05) for one set of equivalent orbitals and to limit the discussion to a set of orbitals 
which are doubly occupied. Equation (3-05) then becomes 

m{x) = 0, (3-06) 

where {J^') is an operator matrix with 

= m + (S'O?) 

=-%««(*) + e™J, (3-08) 

and the functions are obtained from Vnn{^)> appearing in (2-03) and 

(2-04) by changing each ■>jr to one of the x functions. 

To complete the transformation it is necessary to calculate the value of and 
in terms of quantities which depend on the molecular orbitals. It is found possible 
to express them in terms of the quantities and defined in equation (2-14). 
Thus is the quantity derived from when in the various integrals from which 
it is calculated the equivalent orbital wave functions are substituted for molecular 
orbital wave functions and the relation (3-04) is used. For the case when the occupied 
molecular orbitals form closed shells, we have from (2-14) 


fimn = + (3-09) 

Now and J„,„ are the matrix elements with respect to and ^n(^i) of 

H and the function p{j,j) respectively, where 


=P“(ji)+P%‘) (3-10) 

and PH 3 j) = 'Leffi{xj)r/ri{x^). (3-11) 

X 

Similarly (3*12) 

where y 0 “( j, i) = S ef ^^x^) fj(Xi). (3-13) 


Now the summation in the last equation covers a set of molecular orbitals which 
include all the members of certain degenerate irreducible representations. In 
such a case and p°^(j,i) are invariant under any unitary transformation T, 

It follows that 


Kn = nKU = f[SW i^A)] P‘i3> i) W [S dXi = s (W (3-14) 

Jn V 

with a similar result for and Hence 

^mn — H to” (3-15) 

H,v 

where and are elements of the matrix {t) which represents the substitution T, 
and the conjugate elements when the matrix elements are complex. 

This expression will usually simplify siuce the matrix elements J ^ and 

vanish whenever the functions and have different symmetries, that is, belong 
to different irreducible representations of the symmetry group. A similar result holds 



20 


Sir John Lennard- Jones 


when and are different members of a degenerate set, provided they are suitably 
chosen. Accordingly, for a set of molecular orbitals which belong to different repre- 
sentations, equation (3- 16) simplifies to 

emn=S(M4)-®/V«- 

With this expression for e„i„, or the more general one {3-16), the equations to 
determine the set of equivalent orbitals are 

~~ ^nn) Xn ~ S iSmni^) ^mn) Xm> 

m 

and whereas the equations for the molecular orbitals (2- 15) involve a set of different 
functions ..., these equations involve only one unknown function, for %m 

differs from Xn if® orientation; in fact, the equations included in the set (3' 1 7 ) 

can be obtained from one another by cyclical changes of the suffixes, or what amounts 
to the same thing, by supposing the molecule rotated so that the various x-funotions 
are interchanged. The set of equations is, therefore, equivalent to one equation and 
the solution of this implies a solution of all. The problem is to find a distribution 
which, together vrith all the other similar but differently orientated distributions 
Xot> renders the equation (3-17) self-consistent. 

Once a set of equivalent orbitals has been found to solve the equation with 
appropriate values of the parameters and e^^n, the appropriate molecular orbitals 
can be derived by using the substitution which is inverse to equation (3’04), viz. 

if) = T-\X), (3-18) 

and the energy levels can be similarly derived from a relation which is the inverse 
of (3*16) and of the type 

^ lu! ~ S i^mii^nv) ^mn‘ (3*19) 

m,n 


4. Special oases 

It is instructive to find the form taken by the above equations in molecules of 
particxilar symmetry t3rpes. Molecules of C?2„ symmetry, of which triangular ones 
like XTa are examples, have already been considered in part I. Other types of 
special interest because of their wide occurrence in chemistry are those of trigonal, 
tetrahedral and octahedral symmetry. 

(i) Molecules X of trigonal symmetry (Gj^) 

For molecules of the there are two symmetry operators of a rotation 

of 120° clockwise or anticlockwise about the axis of sjnnmetry and these are denoted 
by 2G3. There are also three planes of symmetry through the axis of symmetry (3(r^). 
There are three types of molecular orbital; two of them {A^ and are single and the 
third {E) is doubly degenerate. The character scheme is shown in table 2. (For 
a description of the notation used in this section reference may be made to Mulliken 
(1933) or to Eyring, Walter & Kimball (1946).) 



The molecular orbital theory of chemical valency. II 21 

The substitution matrices of three equivalent orbitals, and Xs, under the 
operations of the group will be respectively of the types 

■E 0, 0-, 

0 0\ /O 0 1\ /I 0 0\. (4-01) 

0 1 0 (1 0 0 0 0 1 

\o 0 1/ \o 1 0/ \o 1 0/ 

These matrices may be denoted by (P)^, where P stands for any of the operators of 
the group. The character system is thus that given in the last line of table 2. It 
follows that the equivalent orbitals can be obtained by a superposition of one 
orbital and two degenerate orbitals of type P. 

Table 2. Types op molboulab oebital in a pield op symmbtey C's,, 

B 2Gs 3or« 

.4^ 1 1 1 

Ai 1 1-1 

E 2-10 
X 3 0 1 


In order to find the matrix of the transformation T it is necessary to exaTninft the 
matrices of operations of the group in the representation {Aj^+E). Two typical 
matrices are 


(Ai + E):(l 0 0\ /I 0 0\, 

0 -i VI 0 1 0 

\o -Vi -y \o 0-1/ 


(4-02) 


and the others, apart from E which is a diagonal matrix, are obtained from them by 
writing the matrices of 0|, cr^Ca and These matrices may be denoted by 

{P)ai+£!- If is then found that the substitution T which gives equivalence between 
corresponding matrices, in the form 


{P)^ = {t){P)A,+j,{t)-\ ' (4-03) 

is (*) = /\/i VI 0\- (4-04) 

(Vi ~Vi “Vi) 

Wi -Vi Vi/ 


The equations for the equivalent orbitals are then given by equations (3*17) with 
the following values for 


~ ^22 “ ^33 ~ i(’®U + ^-®22)> (4-05) 

^12 = ^33 ~ ^23 ~ (4*06) 

where and are the energy parameters which would appear in the equations 
(2-15) for the molecular orbitals and E respectively;* the energy parameters E^^^ 
vanish because of the different symmetry of and E, 

* The use of the symbol E to denote an orbital type and an energy E is unfortunate, but 
this is the notation generally accepted. The energy parameters in this paper always have 
distinguishing suffixes. 



22 


Sir John Lennard- Jones 


(ii) Molecules XY^ of tetrahedral symmetry {Tf) 

In a aiTnilar vay it is found that four equivalent orbitals in tetrahedral symmetry 
are obtained from a superposition of where is an orbital with complete 

tetrahedral symmetry and is a triply degenerate set. The matrix for the sub- 
stitution T is given by 

(«) = i /I 1 1 1\ , (4-07) 

1 1 - 1-1 
1-1 1-1 
a -1 -1 II 

and the energy parameters by 


fill = ^22 — ^ 3 S ®44 “ ( 4 ' 08 ) 

= ®13 ~ ~ ®23 “ ®24 ~ ®34 ~ ~ ( 4 ‘ 09 ) 


where and are the energy parameters associated respectively with the 
orbitals A^ and in equations (2-16). As in the previous example, E^^ and similar 
energy parameters all vanish because of the different symmetries of A j and There 
are thus four similar equations for the equivalent orbitals Xi> Xa Xi> of which 
the following is typical: 

(H + v'n(x)-eu)Xi = gzi(x)X2 + ff3i(^)Xa+Sf4i(!>=)Xi + ei2(X2+Xs+Xi)- 


If the energy parameters E^i and Ezz associated with A^ and should be equal, 
indicating accidental degeneracy, then the last term in the equation would vanish. 
This condition was assumed to be necessary in the original treatment of directed 
tetrahedral orbitals by Pauling (Pauling 1931; Slater 1931; for a discussion of 
this assumption, see Van Vleok & Sherman, 1935). It is seen from the equation 
not to be an essential condition for equivalent orbitals, but it is probable that 
when this condition holds there may be greater concentration of the x orbitals 
about their respective axes. These axes, about which the orbitals will have trigonal 
symmetry, coincide with the respective XT bonds. When the orbitals are concen- 
trated about these axes and fall away rapidly in density elsewhere then the value 
of Xi will be small on the axes of X 2 >Xa Xi> 8'^ 8ny rate in the neighbourhood of 
the Y atoms and beyond. The left-hand side of equation (4-10) will then be small for 
such regions. Under such conditions the y(x) functions will also be small because 
they involve integrals of the products of different x-functions. On the other hand, 
Xa, X 3 8iid Xi will have appreciable values on their own axes. Hence consistency in 
these statements about concentration can only be obtained when c^a is small. 
Presumably the converse holds, viz. that when e^a is large the orbitals are not 
concentrated locally about an individual axis but are spread through the whole 
molecule. 

Under suitable conditions an approximate solution of the equations (4-10) may 
be obtained by neglecting the right-hand side. The potential energy terms in v'^nix) 
represents the electrostatic effects of Xa. Xa and Xi on Xi and imply a strong screening 
of all the T atoms except one. The Xi orbital is then effectively a bicentric one 
subject to the nucleus of X suitably screened, and to one of the F’s and to the 
electrostatic action of the other three bonds. 



23 


The molecular orbital theory of chemical valency. II 


(iii) Molecules XY^ of octahedral symmetry (O^J 

For molecules of octahedral symmetry (0 J there are the 24 elements of symmetry 
of the group 0 together with a centre of symmetry, and so with the process of 
inversion there are 48 symmetry operators. There are ten types of molecular orbital, 
five 9f them being even in the process of inversion and five uneven. They are usually 
denoted by A^, A 2 , E, and with the suffixes g and u according as they are even 
or uneven. 

From a comparison of the character system of six equivalent orbitals and that of 
the molecular orbitals of the 0^ group, it is found that 

(x) ~ -^ICf "I" Eg + 

Axg, as usual, being the orbital with the character system unity, Eg being a doubly 
degenerate orbital and a triply degenerate one. 

The substitution matrix is 


it) = /Vi 0 -Vi 00 Vi\ . 

/Vi 0 -Vi 00 -Vi\ 
Vi i Vi ^ 

Vi i -vfe “Vi ^ 0 1 

Vi “i 0 Vi 

\Vi “i Vife ® “Vi 


(4-11) 


the first column giving the coefficient of A^g, the next two those of Eg, and the last 
three those of the orbitals. If Xi, given by the coefficients of the first row, is directed 
along an axis labelled x, then Xz is directed along —x, Xs along y, Xi along —y, Xs 
along z and Xe along — z. 

Using these matrix elements in equation (3- 16) and remembering that vanishes 
when the suffixes /t and v refer to orbitals of different symmetry or to orbitals which 
are different members of a degenerate set, we find that has the same value for all 
values of n (from 1 to 6), viz. 

en„ = i(^ii + 2i?22 + 3F33), (4-12) 


where E-^x, E^^ and E^z refer respectively to the energy parameters of the orbitals 
A-^,Eg and Txu- Similarly, there are three pairs of equal energy parameters of 
the type 


'=•12 


= e, 


21 


— i'(-®ll+2F22— 3 .F 33 ), 


(4-13) 


and twelve pairs of the type 

®13 ” ®31 “ (4*14) 

The energy parameters given by (4-13) refer to pairs of orbitals which are directed 
in opposite directions. The rest refer to pairs of orbitals whose axes are perpendicular 
to each other. 

The equations to determine the six equivalent orbitals are then all similar to 

5 

{H+v'xx{x)-e2x)Xx= E + 612X2 +«i3(X3+X4 + X6 + Xe)- (^-15) 

m=l 

The interpretation of this equation is clear. The term v[x{x) is the potential energy 
of an electron in the presence of the electrostatic distribution of the other equivalent 
orbitals; four of the functions g^rni similar except for orientation and represent 



24 


Sir John Lennard-Jones 


the potential energy due to an electron distribution given by a product of Xi and 
Xm,\ iihe energy coefficients are equal for the four equivalent orbitals at right angles 
to as would be expected. 

We note, in particular, that both e^a and e^g vanish whenever = jEgg = Egg. 
An example of this occurs in the theory of directed atomic orbitals where for a 
hydrogen-like atom octahedral orbitals can be obtained by superposition of (s), 
(jj)® and (d)® orbitals, aU of which have the same energy for the same total quantum 
number. 

An interesting property of equivalent orbitals may be inferred from the energy 
coefficients Whenever group theory indicates that equivalent orbitals may be 
obtained from the superposition of two types of molecular orbitals, then there are 
only two energy levels E^^ and Egg, and accordingly only two coefficients and 
eij. This implies that the orbitals are symmetrically related to each other in the sense 
that the members of every pair are similarly related to each other. Thus for molecules 
of trigonal S3nnmetry in a plane, three equivalent orbitals are obtained and the angle 
between each pair is the same, viz. 120°. For tetrahedral symmetry four equivalent 
orbitals are so related that the angle between any pair is the same. But when, on 
the other hand, equivalent orbitals are made up of three types of molecular orbital, 
as in the example of octahedral symmetry, the equivalent orbitals are not similarly 
related to each other. There are then three energy parameters E^^, Egg and Egg and 
accordingly three parameters 6^, e^g and e^g. Any one such orbital is then related to 
the rest in two (and only two) ways. Thus in octahedral symmetry each orbital is 
similarly related to four of the others and then differently related to the sixth. 
A similar result is obtained for the case of four equivalent orbitals in a planar 
molecule XY^, where the bonds are directed along the axes of a square. 


6. The electron distribution in symmetrical molecules 


When the electronic configuration of a molecule can be described by a set of 
molecular orbitals determined by equations (2' 15) and by a determinantal function 
4», given by equation (S-Ol), the electronic distribution can be obtained by suitable 
integrations of ^<I>, as was shown by Dirac (1931) to be possible for atoms. The 
expression $<I> dvi . . . is the probability of a given configuration of JV electrons, that 
is, the probability that an electron wiU be found in a given element of volume dr^ 
with prescribed spin, another in drg, and so on. The probability of finding (N~l) 
electrons in a prescribed configuration irrespective of the position of the A'th is 
obtained by iutegrating over the variables of the Nth electron. The probability of 
finding two electrons simultaneously in prescribed elements of volume is obtained 
by integrating over {N — 2) electrons. As we are not interested in the particular 
electrons which appear in the given elements of volume, it is necessary to integrate 
over the remaining variables in such a way that each configuration djg . . . cZt^ appears 
only once. The two-electron distribution function is accordingly proportional to 


o-(l,l) 
<r(2, 1) 


cr(l,2) 

a-(2,2) 


dTj^dTg, 




where 


(6-01) 

( 6 - 02 ) 



25 


The, molecular orbital theory of chemical valency. II 

and the ^-functions are the summations over the wave fimctions of the occupied 
orbitals in ordinary spatial co-ordinates, as defined in equations (3-131 and (3-11). 

It is of some interest to examine the above distribution in two special cases. In 
one case all the orbitals are singly occupied with electrons possessing the same spin, 
and in the other they are doubly occupied with electrons having paired spins. Both 
of these cases are included in the following treatment. 

When the integration is carried out over the spin co-ordinates, the distribution 
function becomes 

S (e« + 1^/(1) S (e« + ef) fJ2 ) - S (e® -i- e|) ^j(l) fJX), 

I m Z,m 

(5-03) 

the last summation implying that contributions arise only from orbitals having the 
same spin. When the orbitals are singly occupied with the same spin, the appropriate 
distribution function is obtained by putting e„ = l,e^ = 0, and when the orbitals 
are doubly occupied, it is necessary to put e„ = 1, = 1. When this is reduced by 

cancelling all possible terms, the expression becomes 

S fi{l) + (e„ -h S' fi[l) Hi) 

I l,m 

- {el+e}) S' ni) m (5-04) 

The first term, involving S, arises from paired electrons in the same orbital, the 

second and third terms, involving S'> fro™, electrons in different orbitals. 

Now from the earlier sections of this paper it is clear that it is immaterial whether 
the wave functions used in the above formulae refer to molecular orbitals or to 
equivalent orbitals. Thus for singly-occupied orbitals with the same spin, the 
probability distribution is 

( S' XxWXxi'^) X/,(2) - S' Xa(1) Xa(2) X^(2) dvj^dv^. (5-05) 

U,iW Kft I 

This gives the probability of finding an electron in an element of volume dv^, and 
another in an element of volume dv^. 

In certain molecules we may expect the orbitals to be localized in different 
bonds. If there is pronounced localization, the value of any one of the x’s may have 
a well-defined maximum, and about this maximum there may baa region of space 
within which x is appreciable, and outside of which x neghgible value. In such 
a case let the region of appreciable Xa b® denoted by The region mayor may 
not overlap another region If it does not, the distribution function (5-05) lends 
itself to a simple physical interpretation. 

Let dvx be in the region of ( 0 ;^. Then the function (5-06) is vanishingly small unless 
dv2 lies in one of the other regions different from coj^. Moreover, it is evident that 
under these conditions the main contribution to the probabihty distribution (5-05) 
comes from the first term. The electrostatic interaction of the electrons, which is 
given by the integration of multiplied by the probability distribution, in that 
case arises mainly from those configurations in which two electrons are in different 



26 


Sir John Lennard- Jones 


w-regions and implies repulsion. In such circumstances the molecule would behave 
as though the w-regions repelled each other. 

For molecules which contain pairs of electrons in sets of orbitals and consist of 
complete shells, the corresponding probability function is 

2 S Xa(1) Xa(1) ^(2) ;\:a(2) + 4 S' Ml)Xx{l) xP) X^(2) 

A A,/t 

-2S'XA(l)XA(2)5e^(2)A:;.(l)- (5-06) 

X,/t 

The first term arises from paired electrons in the same orbital and the other terms 
from electrons in different orbitals. This formula can be interpreted by means of the 
model described above. For orbitals localized in well-defined w-regions, this function 
has its maximum values for elements of volume dv^ and dv^ which are either in the 
same region (first term) or in different regions o)^ and (second term). The third 
term is always small unless regions ( 0 ;^ and overlap. The main contributions to the 
electrostatic energy then come from the repulsions of electrons in the same orbital 
or from repulsions of electrons in different orbitals. The attraction arising from the 
third term is small. Hence, again, the molecule would behave as though the w-regions 
repelled each other. Though this particular representation of x-functions is highly 
idealized, it yet gives some physical insight into the forces which tend to give 
stability to molecules of the t 3 rpe XY^, particularly in resisting deformation from 
the symmetrical form. Thus the first term (6'06) will in any event give the mutual 
repulsion of a pair of electrons in a ^-orbital, which is associated with each of the 
XT bonds. The interaction of bonds is due to the distribution given by the second 
and third terms of (6-06). The second term, being positive, will produce repulsion; 
the third, being negative, attraction. When the x-funotions are localized in particular 
bonds, it seems likely that the repulsive contribution will predominate and cause 
the X Y bonds to be symmetrically disposed in space. 

The author is indebted to Mr G. G. Had for reading the paper in manuscript and 
making some valuable suggestions about notation. 


Refbbencbs 

Dirac, P. A. M. 1931 Proa. Comb. PhU. Soc. 27, 240. 

Byring, H., Walter, J. & Kimball, G. E. 1946 Quantum chemistry. Appendix vii. New York: 

Wiley; London: Chapman & Hall. 

Lennard -Jones, J. E. 1949 Proc. Boy, Soo. A, 198, 1. 

Kimball, G. 1940 J. Chem. Phys. 8, 188. 

Mulliken, R. S. 1933 Phys. Rev. 43, 279. 

Pauling, L. 1931 J. Amer. Ohem. Soo. 53, 1367. 

Slater, J. C. 1931 Phys. Rev. 37, 481. 

VanVleck, J. H. & Sherman, A. 1935 Rev. Mod. Phys. 7, 167. 



A note on polar air-mass modification 


By R. Frost 

{Oommumcated by Sir Geoffrey Taylor, F.B.8.— Received 6 October 1948— 

Revised 4 February 1949) 

^FormulQiO ato derived for the increflise of temperature and moisture content in. polar air 
which passes over a warm sea surface. These results are in very good agreement with observa- 
tions and should be of use to forecasters. 

1. Intbodttotion 

In view of the emphasis which modem methods of forecasting place on air-mass 
analysis the manner in which the source properties of continental polar air are 
modified by passing over a warm sea surface thereby absorbing heat and moisture 
is of fundamental importance. 

The present writer in 1946 discussed the case in which air, which had been com- 
pletely stirred by convection and turbulence over land to give a dry adiabatic lapse 
rate and a humidity mixing ratio which was constant with height, passed over a 
warm sea whose surface temperature was uniform, and found very good agreement 
between theory and observation. In practice, however, these relatively , simple 
boundary conditions are infrequently realized. According to Sverdrup (194a), for 
example, when winds from the land blow over the sea the surface water is carried 
away from the coast and the consequent upwelling of the subsurface water which 
takes place brings water of greater density and lower temperature to the surface, 
so that in general the temperature of the sea surface increases with distance downwind 
from the coast, whilst observations from continental stations quoted by Petterssen 
(1940) show that in well-stirred continental polar air the lapse rate is generally less 
than the dry adiabatic, whilst the humidity mixing ratio usually decreases fairly 
rapidly with height. 

In spite of this, many forecasters applied the theory to the practical problems of 
forecasting with a fair degree of success, and in an investigation carried out by staff 
of the Meteorological Office (of which a summary is given in an Aviation meteorological 
report of eastern England), it was found that the formulae gave good agreement 
with observation, the formula for the increase of temperature giving better agree- 
ment with observations than that for the increase of water vapour which tended to 
overestimate the increase of humidity mixing ratio. 

In § 2 of the present paper the effect of a sea surface whose temperature increases 
uniformly with distance from the coast is discussed, and in § 3 the evaporation from 
an ocean surface is discussed with the assumption that the humidity mixing ratio 
initially decreases either linearly with height or as a power of the height. It is shown 
that the original formula for the increase of temperature holds to a high degree of 
approximation irrespective of the initial stability of the air provided that the sea 
temperature at the end of the trajectory is used in place of a uniform sea temperature, 

[ 27 1 



28 


B. Frost 


and that the original formula for the increase of water vapoxir can be modified 
without difficulty to take account of an initial decrease of humidity mixing ratio 
with height. 


2. Theory or tbmpeeatctrb inorbase 


(a) General 


Let X he measured downwind from the coast and z be measured vertically upwards, 
then, neglecting horizontal diffusion, the equation stating that in the steady state 
adveotion and diffusion balance is 


dx dz 



( 2 - 01 ) 


where U is the mean velocity of the air at a height z in the direction of x increasing, 
K is the coefficient of eddy diffusion and 6 is the potential temperature of the air. 

With U and K represented as m the earlier paper by the following conjugate 
power laws ^ (2-02) 

K = (2-03) 


where is the mean velocity of the air at a standard height A, Zq is a length character- 
istic of the degree of roughness of the sea surface, and m is a non-dimensional constant 
which is a function of the thermal stability, equation (2*01) becomes 


de d ( , d 0 ) 

(2-04) 

where a = 


The implications of the above power-law form of IC have been discussed by the 
present writer (1948). 

The solution of (2-04) which satisfies the boundary conditions 0 = '0g, a constant 
over land, and 0 = 0^, a, constant over the sea, is 

0 = 0o + (0i-0o)I, 

(2-06) 

r* OT+i 

1 ^ 2m+l 

where I — ^ 

(2-06) 

[2m + 1/ 



(2-07) 


Using a value of m = ^ appropriate to an adiabatic lapse rate* and a value of the 


* Sir Geoffrey Taylor, in a private communication, suggests that as the instability would 
increase with increasing distance downwind, the index m should decrease downwind, and this 
suggestion is supported by the observations in table 4 which show that the errors between 
observation and theory (with m constant) increase with increasing distance. For distances 
of between 100 and 500 km. which are considered in this paper, the variation of I with m is 
small, and no serious errors are introduced by using a constant value of the index. Even if 
the law of variation of m with x were known, however, the mathematical difficulties of solution 
of 2‘04 wdth m varying with a; would be prohibitive. 



A note on ^ola/r air-mass modification 29 

roughness parameter Zq = 1 cm. appropriate to a rough sea surface as in the earlier 
paper (1946), 

(2-08) 

The values of ^ for selected values of x and z computed from equation (2‘08) are given 
in table 1, and values of ^ for selected values of a; at a height of 10m. are shown in 
figure 1. 

Table 1 



1 km. 

10 km. 

100 km. 

1000 km. 

Im. 

1-6 X 10-2 

1-6 X 10-2 

1-6 X 10-* 

1-6 X 10-® 

10 m. 

3*0 X 10-1 

3*0x10-2 

3-0 X 10-^ 

3-0 X 10- 

100 m. 

6-9 

5*9 X 10-1 

6-9 X 10-“ 

6-9 X 10-* 


0-07 0-7 
"06 0'6 
0-05 0-5 
0-04 0'4 

0 03 0-3 
0’02 0-2 

001 01 
0 

0 -1 -2 -3 -4 -S -6 

log I 

Figtob 1. Graphical representation of tables 1 and 2. 

(6) Case of d = over land and 6 = d-^+bx over the sea, where 6^ > 

In the present note a solution of (2-04) will first be obtained which satisfies 6 = 6^, 
a constant over land, and 0 + 6®, a constant over the sea, where 6 is a constant. 

Inspection of the dimensions of the terms in (2-04) suggests that 0 is a function 
of but a simple solution 6 = f{f,) makes d a constant both when ^-^0 and when 
^->00. It suggests, however, that it should be possible to find a solution 

d = d^+hxf{i), (2-09) 

when ^->-0 and when ^->oo. (2-10) 



where 




30 R. Frost 

Denoting differentiation with respect to i by dashed letters, /(^) must satisfy 


Writmg 

must satisfy 




= 0, 


and hence 
where C and p are constants. Thus 


_ 1+m 

m = o\ - — , , ^ 


'( 


^^l+2m/ 


(2-12) 

(2-13) 


_ 1^+m 


■/j+i+B.y 

\^H-2m/ 

Now from (2-10)/(^)->-0 when ^-»oo, and hence = oo. Therefore 

Integrating by parts twice 


(2-14) 


_ l+m 
^"*14- 2m 

+m V 




M. 


4- 2m/ 


r m 

l+m\ /H-2m'\ e-S^i+2m l + 

i+m\ -TfirJi * 

\^'^l + 2m/ 

Since from (2- 10) /(^) ^ 1 when £->-0 


'“'l+2m 


{2-16) 


(2-16) 


/'l + 2m'\ 

\- 


(l + mj 

1 

r| 

! m \ 

ln- 2 mj 


Hence 


6 = ^*0+^* 


/'l + 2m'| 


^l+2m 

[ 1+m J 

'ri 

f m \ 

^^l4•2m/ 


(2-17) 

(2-18) 

(2-19) 


is the solution which satisfies the required conditions. 

Since both (2-06) and (2' 19) are solutions of (2-04), it follows that 




m 

/ l + 2m,,\y (l + 2m)e-^|i+2OT 


( 2 - 20 ) 



31 


A note on polar air-mass modification 


is also a solution of (2-04), and it can readily be seen that (2-20) is the solution of the 
differential equation which fits the boundary conditions 6 = 6^ over land and 
0 = di + bx over the sea where 6^ > Oq. 

Writing ^i+6a; = equation (2-20) may be written 


or 


where 


^ = ^0+ + 





^I- 


^l+2m 



o = d^+{e^-ea)i-{d^-e{)H, 



( 2 - 21 ) 

(2-22) 

(2-23) 


Values of H for selected values of together with the values of I which have 
been extracted from table 1 of the earlier paper, are given in table 2 and are shown 
graphically in figure 1. 

Table 2 



10-' 

10-4 

io-» 

10-2 

10-1 

1 

H 

0-037 

0-047 

0-061 

0-074 

0-071 

0-019 

I 

0-706 

0-621 

0-510 

0-368 

0-190 

0-027 


Since 6^ > $g, it follows that for very small values of the temperature is given to 
a good degree of approximation by 

e^6o+{d,-eo)i, ( 2 - 24 ) 

which is similar to (2-06) except that d^, the temperature of the sea at a distance x 
downwind, now replaces a uniform sea-surface temperature. 

At any distance x downwind it follows from equation (2-22) that the potential 
temperature 6 can be expressed by an equation of the type 


e = do+(d-do)I, ( 2 - 26 ) 

whfere d is the weighted mean of the initial and final sea temperature and is given by 

5 + ( 2 - 26 ) 

Table 3, which is derived from figure 1, gives values of / and H at a height of 10 m. 
at various distances downwind from the coast. 


Table 3 


distance in km. 

10 

50 

100 

200 

300 

I 

0-28 

0-40 

0-44 

0-48 

0-51 

H 

0-076 

0-071 

0-070 

0-064 

0-061 

distance in km. 

400 

500 

600 

800 

1000 

I 

0-525 

0-54 

0-55 

0-56 

0-57 

H 

0-059 

0-057 

0-056 

0-055 

0-054 




32 


R. Frost 


From table 3 it can be seen that for calculating the potential temperature at a 
height of say 10 m., the final and initial sea temperatures have to be weighted in the 
ratio 2^ : 1 at a distance of about 10 km. from the coast, in the ratio of 4: 1 at a distance 
of about 40 km. from the coast, in the ratio 7 : 1 at a distance of about 200 km. from 
the coast, and in the ratio of 10 : 1 at a distance of about 1100 km. For most practical 
purposes therefore it should be sufficient to know the temperature of the sea at the 
end of the trajectory where fortunately it can most readily be measured. 

(c) Comparison tvith observation 

Bdein, Espy & Palladino (1945), in an investigation into methods of forecasting 
temperatures on board weather ships during outbreaks of polar air over the north 
Atlantic, when climatological data showed that the increase of sea temperature with 
distance from the coast was approximately linear, found that AT (the temperature 
difference between the air temperatmre as measured at a weather ship and the tem- 
perature of the sea surface in the vicinity of the ship) was more highly correlated 
with the sea-surface temperature in the vicinity of the ship than with the average 
sea-surface temperature between the land and the ship. Burke (1945), who developed 
a method of forecasting temperatures on board ship during such outbreaks of polar 
air, for which a knowledge of the sea-surface temperature was required, found for 
trajectories of less than 680 km. over the sea that close agreement could be obtained 
by using a mean sea-surface temperature in which the final sea-surface temperature 
was weighted three times as heavily as the initial sea-surface temperature, whilst 
for trajectories of 680 km. or over he found that the closest agreement was obtained 
by using the final sea-surface temperature only. 

In table 4 a comparison between observations given by Burke (1945) and values 
calculated from equation (2' 24) with the aid of table 3 is given. The agreement is 
reasonably good. 

Table 4 


length of 
over-water 

initial 
surface air 

average lapse 
rate over land 
between 0 and 
1500 m. as % 

mean or 
iuial sea- 

observed 

calculated 


error by 

trajectory 

temperature 

in 

of dry adiabatic 

surface 

air tern- 

air tem- 


Burke’s 

in km. 

lapse rate temperature perature 

perature 

error 

method 

240 

269 

85 

276 

273 

272-5 

-0-5 

0 

870 

262 

73 

276 

269 

270-0 

-f- 1-0 

+ 2-0 

1020 

268 

80 

279 

272 

274-5 

+ 2-5 

+ 4-0 


In the majority of the observations given by Burke however, the average lapse 
rate in the cold air over the land differed markedly from the dry adiabatic lapse rate 
and the effect of this upon the final temperature is difificult to assess quantitatively. 
The effects of any initial stratification of the air are (i) to damp down the eddies and 
thereby to reduce the fiux of heat from the sea to the air and (u) to confine this 
reduced heat within a smaller thickness of the atmosphere, and these effects are in ' 
opposition. This can readily be seen from a diagram. Thus in figure 2 curves a' and h' 
represent the variation of temperature with pressure at a given distance downwind 
over the sea corresponding to the initial states over the land, a = a dry adiabatic 



33 


A note on polar air-mass modification 

lapse rate and 6 == a stable lapse rate. If the flux of heat from the sea to the air were 
the same in the two cases the areas enclosed by curves aa' and the log p axis and curves 
66' and the log^ axis would be equal. The temperature of the air near the surface at 
a given distance downwind over the sea would therefore be greater when the air is 
initially stable than when it is initially in neutral equilibrium. As, however, the effect 
of any initial stratification is also to reduce the flux of heat, the area between 66' 
and the log p axis is less than the area between aa' and the log^ axis, and hence the 
difference between the final air temperatures is correspondingly reduced. It might 
therefore be expected that over a limited distance of travel downwind over the sea, 
the distance increasing with proximity to the sea surface, formulae (2-22) and (2*24) 
would give a good approximation to the observed temperatures irrespective of the 
initial stability of the air. 

Table 5 


initial Burke’s 

lapse mean sea- 

initial rate as initial sea- final sea- surface 


length of surface-air 

% of dry 

surface 

surface 

tempera- 

observed 

calculated 

error 

error 

over-water tempera- 

adiabatic tempera- 

tempera- 

ture 

air 

air 

from 

from 

trajectory 

ture 

lapse 

ture 

ture 

3^1 4“ 0^ 

tempera- 

tempera- 

equation 

equation 

X 

e. 

rate 



4 

ture 

ture 

(2-24) 

(2-22) 

240 

269-0 

85 

— 

— 

276 

273 

272-5 

-0-5 


320 

274-0 

53 

— 

— 

277 

276 

274-5 

-1*6 

— 

330 

260-0 

47 

— 

— 

274 

268 

267 

-1-0 

— 

356 

274-6 

47 

281-6 

283 

— 

277-6 

279 

— 

+ 1-6 

370 

266-0 

0 

281-6 

289-6 

— 

279-0 

277-6 

— 

-1*6 

400 

274-0 

40 

— 

— 

279 

278 

276-6 

-1-6 

— 

420 

267-0 

36 

277 

284 

— 

276 

276-6 

— 

-0*6 

426 

273-0 

13 

— 

— 

277 

274 

275 

1-0 

— 

430 

268-5 

31 

277 

284 

— 

276 

276-6 

— 

+ 0-6 

440 

271-0 

29 

284 

287 

— 

278-6 

279-6 

— 

H-1-0 

490 

267-6 

27 

282-6 

292-6 

— 

279-0 

280-6 

— 

+ 1-6 

600 

266-5 

13 

282 

286 

— 

277-0 

276-5 

— 

-0-6 

600 

264-0 

25 

282 

291 

— 

' 278 

278 

— 

0-0 

606 

268-0 

63 

— 

— 

274 

272 

271 

-1-0 

— 

520 

263-6 

28 

277'6 

284 

— 

275-6 

274 

— 

-1-6 

650 

263-5 

30 

277-5 

284-5 

— 

276 

274-6 


-0-6 


Table 5 shows that for a height of 10 m. above the surface of the sea Burke’s 
observations support this expectation for distances of travel up to 550 km. from the 
coast. It can be seen from this table that in the cases for which both the initial and 
final sea-surface temperatures are available and it is possible to use the more exact 
equation (2-22), the mean error is 0°K and the mean absolute error is 0*9° K. In 
the other oases for which, in the absence of a 'final sea-surface temperature, Burke’s 
mean sea temperature has been used, the mean error is — 0-75°K, which suggests 
that Burke has given too much weight to the initial sea-surface temperature in 
obtaining the mean. 

For distances greater than 550 km. comparison of Burke’s observations with 
temperatures calculated from equation (2-22) or (2-24) shows that the calculated 
values are too low. The errors are roughly proportional to the difference between the 
actual lapse rate and the dry adiabatic lapse rate and increase with increasing 
distance over 500 km. 


Vol. 198 . A. 


3 



34 


E,. Frost 


For forecasting temperatnre at a height of 1’2 m. (4 ft.) it is probable that formulae 
(2-22) and (2-24) would give a good approximation to the observed temperature for 
trajectories of more than 1000 km. 



temperature -»■ 

Fiqdiib 2. Temperature increase due to initial stability illustrated diagrammatioally. 


3. Theory oe MOisitTRS inobbasb 


(a) General 

The partial differential equation (2'04) may be used for the discussion of the flux 
of water vapour providing d, the potential temperature in the equation, is replaced 
by [i, the humidity mixing ratio, i.e. 



where a = mzj”*. 


(3-01) 


The solution of the equation which satisfies the boundary conditions /t = /to, 
a constant over land, and /t = /t„ a constant over the sea, is as in (2-06) 


/f-/to = (/ts-/to)I, (3-02) 

where I is given by (2'06). Using as before a value of m = ^ for unstable lapse rates 
and a value of the roughness parameter Zq = 1 cm. appropriate to a rough sea surface. 


Table 6 


X (km.) 

... 

100 

200 

300 

400 

1 


2 X 10”^ 

10-4 

6-7 X 10-5 

5 X 10-5 

I 


0-698 

0-620 

0-637 

0-660 

i-» 


2-93 X 10^ 

4-64 X 102 

6-07 X 102 

7-36 X 102 

X (km.) 

... 

500 

700 

900 

1000 

i 


4x10-5 

2-8 X 10-5 

2-2 X 10-5 

2 X 10-5 

I 


0-660 

0-671 

0-679 

0-682 



8-57 X 102 

1-08 X 102 

1-27 X 103 

1-36x103 



35 


A note on polar air-mass modification 

the values of I and g at a height of 4 ft. for selected values of « computed from 
equations (2-06) and (2-07) are given in table 6. The last line gives values of 
which are required at a later stage. 


(6) Case of ii = over the land and fi = /ig a constant over the sea 

A solution of equation (3-01) will now be derived which satisfies the boundary- 
conditions ft = fig over the sea and [1 = fi^—Xz over the land. 

Inspection of the dimensions of the terms in (3-01 ) suggests that /t is a function of 
but a simple solution p = /(|) makes fi a constant both when ^->0 and when ^ oo. 
It suggests, however, that it should be possible to find a solution 

fi-Ho = ifig-fiffl-XzM). (3-03) 

Denoting differentiation with respect to ^ by dashed letters, /(|) must satisfy 




Writing 

then must satisfy 


3+m 

/(g) = 2(l+2m), 




1 3+to ^ * 

m \“1 

\l + 2,ml 

L~4 2(l + 2TO)g'^ 

J 


= 0 . 


(3-04) 


(3-06) 


This is, however, the confluent hypergeometric (Whittaker & Watson 1927, chapter 
16) equation of which two independent solutions are 



M. (£)_£*+«e-*f(l+ £+(H«-*)(f+»-A)p ) 

5 e \ ^l!(l + 2»)=’ 2!(l + 2?i)(2 + 2n) ^ j’ 

(3-06) 

and 

^k,-rS&)-^ ® \ ^1!(1 + 2 to)^^ 2!(1-2 tc)( 2-2«) /’ 

(3-07) 

where 

, 3+m j m 

*“”2(1 + 2m) ^”2(1 + 2m)’ 

(3-08) 

and hence the general solution is 




(3-09) 


where A and B are arbitrary constants which have to be determined by the boundary 
conditions. 

Now since when 2->0, g->0 and hence ft fig, it follows that 


or 

or 


3+m r 1, m _ 1_ m 

]im^Z^ 2(l+2m)|u4^^2<l+2m)+ J5^ 2a+2m) j = 0, 

lim a[az’”{(2»w+ 1)® oa;}i+a»i+ 5{(2m+ l)®aa;}i+ 2 >n] = 0, 
z-*-0 


(3-10) 

(3-11) 

(3-12) 


whence .8 = 0. 



36 


R. Frost 


Now when a;->0, ^->-00 and /t->/to— Aa, and hence 


lim^~2(i+2ni)e“*S J.I4 „(|) == 1, 

i-^00 


(3-13) 

(3-14) 

where ^ii) is the Whittaker function, and making use of the asymptotic expansions 

for and 

wr raSs e-t! ( - a-*+ r^^B) ''*"■*'* 

(3-16) 
(3-16) 


whence 


or 






A = 


r(i+»-fc) 

r(i+2») 


-'/i ^ — — — \ 

\ ^1 + 2ot/ 


(3-17) 


and hence the solution of equation (3-01) which satisfies the required boundary 
conditions is 


34*m 


fi— /io = {M's~ /*o) jf — Aa e~*f | 2tt+2)n) ■ 


\ ^l+2m/ 
B ^l + 2m/ 


■34, n(^)- 


(3-18) 


For distances downwind from the coast of over 100 km. and for small heights above 
the surface of the sea ^ is very small, and hence from (3-06) /t may be written to 
a high degree of approximation by 


= -A2J— ^ {E 




g l+2m. 


(3-19) 


(c) Power-law decrease of humidity mixing ratio with height 

If the initial humidity mixing ratio instead of decreasing linearly with height, 
decreased as a power of the height, i.e. lim/i = where < 1, a similar 

analysis to the above gives 


a:->0 


l+2ffl+m 

y—y(j = {ys'~ yo) f 2o+2m) 


H 


m 

+ 2mJ 


where 


m 


l + 2p+m , 

2(l + 2m) ^~2(l + 2m)’ 


^i4.„(^), (3-20) 

(3-21) 



A Tiote on polar air-mass modification 
and the approximate solution in this case is 


/{-/to = 






(y-m) 
^ l+2m. 


37 


(3-22) 


Application 

If in (3' 19) m is given a value of then 

li-Pa = (Ps-Z^o) 7- Az X 0-98^-», (3-23) 

and hence the effect of an initial humidity mixing ratio which decreases linearly with 
height is to decrease the humidity mixing ratio as given by the simple rule (3-02) 
by approximately (3.34) 


Hann (1929) gave the foEowing formula for the average variation of vapour pressure 

with height: 2 

e = e„ 10“63oooo, (3-26) 

I 

where Bq is the vapour pressure at the ground and z is measured in cm. 

This formula for heights up to about 1000 m. gives results not very different from 
the formula adopted by Kaminski (1900) in the Climatological atlas of the Russian 
Empire for the reduction of values of vapour pressure to sea-level, 

6o = e{l + 4xlO“®;2), (3-26) 

and since the vapour pressure near the surface is very nearly proportional to the 
humidity mixing ratio, either formula gives to a very good degree of approximation 


= -4x10-62:), . (3-27) 

which is the initial condition assumed in the discussion of this problem. Thus 

As: — X 4 X 10-62;. (3-28) 

At a height of 4 ft. above the surface therefore the humidity mixing ratio as given 
by the simple formula (3*02) is in excess of the actual humidity mixing ratio by 


approximately 


5 X 10-^ X 


(3*29) 


In a case discussed in the Aviation meteorological report for eastern England, during 
a spell of ' easterlies " in January 1941, air left the continent with a temperature and 
dew-point of 21 and 17° F respectively, and passed over the North Sea whose 
surface temperature was 42° F. The observed values of the temperature of the air 
arriving at the east coast varied from 34 to 36° F and the dew-point from 30 to 3 1 ° F. 
Now from equation (2*05) and table 6 the calculated temperature after a trajectory 

of 400km. IS y ^ 21 + 0-65(42-21) = 34-6°I’, 


in good agreement with the observed values. Similarly from equation (3*02) and 
table 6 the calculated humidity mixing ratio corresponding to the above temperature 

iq 

= 1-9 + 0-66{6-65 - 1-9) = 4-33, 



38 


B. Frost 


which corresponds to a dew-point of 36-0° F which is clearly far too high. If, however, 
allowance is made for the initial decrease of humidity mixing ratio with height, then 
from equations (3-02) and (3-29), the humidity mixing ratio is given by 

^ = 1-9-4-0-66(6-66- 1-9)- 1-9 x 6 X 10“* x 7-36 x lO^ 

= 4-33 -0-7 
= 3-63, 

which corresponds to a dew-point of 30-7° F, in very good agreement with the 
observed values. 

It can be seen from equation (3*29) that for a distance of travel of 376 km. the 
correction for the initial decrease of humidity mixing 'ratio with height is 0-36/to, 
and hence for this distance the final humidity mixing ratio is 

/t — 0‘66/t^H- 0*36/to"“ 0*36/to 

= 0-66/t^, 

or in other words the dew-point of the air at 4 ft. at this distance is independent of 
the initial dew-point of the air and depends only upon the sea temperature. 

Itis considered that formula (3-02), as modified by (3-29), should provide a valuable 
aid to forecasters. 

I am indebted to the Director of the Meteorological Office for permission to publish 
this paper. 

RbB'BREKTOBS 

Burke, C. J. 1943 J- Met., MiUon {Maaa.), 2, no. 2. 

Frost, R. 1946 Proc. Boy. Soe. A, 186, 20. 

Frost, R. 1948 Quart. J. R. Met. Soe. 74, 316. 

Haim, J. 1939 Lehrhwh der meteorologie, 4 Auflage. Leipzig: Chr. Horm, Touohnitz. 
Kaminski, A. 1900 Atlas OlimcUologique del’ Empire deRussie. St Petersbourg : I’observatoire 
Physique Central Mcolas. 

Klein, W. H., Espy, R. B. & Palladino, A. J. 1945 U.S.A.A.F. Rep. no. 971, Washington. 
Petterssen, S. 1940 Weather analysis and forecasting, p. 179. New York: McGraw-Hill 
Book Co. 

Sverdrup, H. TJ. 1942 Oceanography for meteorologists. New York: Prentice Hall Inc. 
Whittaker, E. T. & Watson, G. N. 1927 A course of modem analysis. Cambridge University 
Press. 



Unified field theory in a curvature-free 
five-dimensional manifold 

By J. G. Bennett, R. L. Beown and M. W. Thbing 

Institute for the Comparative Study of History, Philosophy and the 
Sciences, Kingston, Surrey. 

{Communicated by N. F. Mott, F.B.S. — Received 3 November 1948) 


Instead of identifying fields with the curvature of a metric, the present theory shows that 
they may be identified with the manner in which the four-way measuring system of the 
physical observer 0 is embedded in a fiat five-dimensional manifold provided that due account 
is taken of the imperoeptibility of the fifth dimension. In this system fields are introduced 
by treating the direction cosines, Hj, of the four directions of measmement and of the im- 
perceptible direction as variable functions of position in the manifold. The track of an 
unconstrained body P is taken as a straight line (cosmodesic) in the manifold, but the ‘pro- 
jection’ of it which 0 observes in his four-co-ordinate system is in general curved. Thus the 
equation describing the element da of P’s cosmodesic in O’s four-co-ordinate system (Aa:/*) is 

<fa»cos» A-2<fesinA {(J = AxflAicK 

When 0 applies the variational condition to da which expresses the fact that the cosmodesic 
is straight, he concludes that it has a space-time curvature with two distinct components,, 
one dependent upon A which is the angle between the cosmodesic and an universal direction 
and upon*'^6, the other acting equally on all P bodies whatever the value of A and depending 
only on These ‘ accelerations ’ are shown to correspond to electromagnetic and gravitational 

fields respectively, and the inverse square law of force is shown to hold for spherically sym- 
metrical fields of both types as a consequence of the condition of coherence of the measuring 
system. 

When the cause of the positional variation of the is a heavy body, having a constrained 
rotation, it is shown to give rise to the magnetic field that a body of charge equal to its gravi- 
tation mass would have, without the corresponding electrostatic field. 

The H/s are restricted by the requirement that the angles between the absolute fifth 
direction, the direction imperceptible for O, and the direction orthogonal to O’s four measuring 
directions, are all null. 

A list of symbols used in this paper is given in an appendix on p. 60, 

1. Geometeical eepeesentatiok of febbdom and consteaint 

1*1. Types of field theory 

Field theories are framed to give a mathematical account of the observed curvilinear 
paths of unconstrained moving bodies. There are various ways in which such theories 
can be constructed. A primary distinction can be drawn according to whether the 
paths are regarded as ‘reaUy’ curved, or ‘really’ straight. If they are ‘really’ 
curved, they may be described in terms of central forces ‘acting at a distance’. 
This is the Newton-Maxwell type of theory, and it fails to give a complete account 
of the observed facts. Alternatively, the curvature can be ascribed to the metric 
framework to which the observations are referred. This leads to so-called ‘ geometrical 
theories’. These can be constructed to give an account, accurate within the present 

[ 39 ] 



40 


J. Gr. Bennett, R. L. Brown and M. W. Thring 

limits of verifiability, of unconstrained gravitational motions. They fail, however, 
to offer any simple and readily visualized representation of such motions, and still less 
do they make it easy to represent the electromagnetic field or the connexion between 
these two disclosed by the gravitomagnetic effect. 

A third lina is to take the paths of all unconstrained bodies as absolutely straight 
— ^thus adopting a simple and natural extension of Newton’s first law. In this case 
it is necessary to postulate an absolute reference manifold, itself free from curvature, 
without which absolute straightness has no meaning. Both the path and the reference 
manifold being regarded as free from curvature, the observed curvature of the 
motion must, in this t3^e of theory, be ascribed solely to some property of the 
observer and his measuring system and the manner in which this system is embedded 
in the reference manifold. In the present paper a theory of the ‘absolute straight 
path’ type is developed in which the reference manifold is five-dimensional and 
curvature-free. This appears to conflict with the demonstration that the geometrical 
interpretation of the gravitational field requires a non-Euclidean metric (Schwarz- 
schild equation) that cannot be embedded in a Euclidean manifold of less than ten 
dimensions. (Eddington 1924). It will, however, be shown that this difficulty does 
not arise if the path of the unconstrained body and the measuring system of the 
observer are independently related to the reference manifold. 

1-2. Statement of basic 'postulates 

In order to give a mathematical account of the results of identifying the curvature 
of observed paths with properties of the observer and his measuring system it is 
necessary to express these properties in a geometrical form. They are: as regards 
the observer, the fact that his measurements are, at any given point, confined to 
only four (three of space and one of time) out of the possible five independent direc- 
tions in the cosmic manifold; and, as regards the measuring system, the fact that 
any situation in the five-dimensional manifold must, by a generalized projection, 
be referred back to the measuring system. The rigidity of O’s measuring system 
enables the directions of measurement at two spatially separate points to be related 
to one another. The manner in which this is effected, combined with the restriction 
upon the measurements of the observer, gives rise to the equations for the field. In 
this way, a theory is constructed which is entirely free from action at a distance, 
a result which has not been obtained even in geometrical as opposed to central 
force theories. 

These considerations can be formulated in terms of postulates defining four con- 
stituent elements in a generalized dynamical system. 

(a) A five-dimensional flat cosmic manifold and an ‘absolute observer’ Q, able 
to measure ‘true’ intervals BS between any two points in the manifold. If then Q 
sets up a rectangular, orthogonal co-ordinate system {^Q, ^Q, ^Q) for the whole 

manifold, with origin R, the interval BS will satisfy the relation 

B8^ = -(1(3)2- (2<2)2- (3^)2 + (4Q)2 + (5^)2. (I*!) 

In this equation the ’‘Q, the co-ordinates in the Q system of the point S, are all real 
numbers and the difference between the space-like (^O, ®0,®0) and time-like 



Unified field theory in a curvature-free five-dimensional manifold 41 

(^Oj ®0) directions is indicated by the Minkowskian device of positive and negative 
additions of the squares of these numbers. The fifth (time-like) co-ordinate can 
conveniently be called 'anti-time’ or 'eternity’. The term anti-time will be used 
in the present paper. The property of irreversibility associated with temporal 
processes does not arise in field theory, nevertheless it may be worth noting that 
the anti-time postulate suggested itself to us from the thermodynamical consider- 
ation that conservation and irreversibility hold only for temporal processes. 
Symmetry suggests that there should also be a direction along which these effects 
are reversed and we find in fact that for anti-time displacements entropy is 
conserved but the total energy content of a closed system may have different values, 

(6) A physical observer 0 who is ' anti-time blind ’ ; i.e. who can only make measure- 
ments of space-like intervals (rigid rulers) and time-like intervals (clocks) but can 
neither observe nor make measurements in a certain direction which is near to the 
universal anti-time direction 0 depends for his observations upon his own rigid 

measuring system, and it is assumed that, referred to the cosmic manifold, this 
need be neither rectilinear nor orthogonal at any given point. We shall describe the 
kind of time constancy of shape and size, possessed by O’s constructional materials, 
by the term relative rigidity to distinguish it from absolute rigidity which would 
apply to configurations self- congruent for all possible linear transformations in the 
cosmic manifold. We can readily see that the relative rigidity of O’s experience does 
not exist for Q. Let R be taken as origin of O’s co-ordinate system. The measure- 
ments given by O’s rulers (X^, X^, X^) and clocks (X^) starting from the point R are 
thus taken along curved lines through the point; these are assumed to lie close to 
but not necessarily to coincide with the rectilinear orthogonal axes of Q at the 
point jB. Thus what appears to be rigid for 0 changes shape for Q* This implies 
that the distance between two of O’s time lines, which for him is constant, is not in 
general constant for Q, It is by the conditions which have to be introduced to 
ensure the stationary time character and isotropic space character of O’s measuring 
system that the conception of 'restricted rigidity’ is given a geometrical 
interpretation. 

These conditions can be formulated in terms of the conservation principle that 
for any possible observer a stationary system does no work. An equivalent, if less 
obvious, statement is that the density of a body on which no work is done remains 
constant for any observer. From this it follows that the volume of a ' relatively rigid ’ 
body (e.g. O’s ruler) remains stationary for Q, although by Q’s measurements, the 
distance between any two points changes with O’s time. In other words, the ruler 
which is ideally rigid for 0 behaves as if it were ideally plastic for Q, As seen by Q 
the dimensions of an element of a rigid body change in such a way that the alteration 
in the direction of the field-producing body compensates those in the orthogonal 
directions so that the volume remains constant. For 0, of course, the dimensions do 
not change. 

(c) An unconstrained body P, the observation of which is the only means available 
to the observer for detecting the presence of a field. P is without space extension, 

* It will appear later that besides being small the angle between these two directions must 
be a null angle, i.e. an angle whose cosine is unity. 



42 


J. G. Bennett, R. L. Brown and M. W. Thring 

and its temporal existence can be represented by a straight line in the Q-co-ordinate 
system. To indicate the ‘absolute’ character of the straightness of P’s path we shall 
use the term ‘oosmodesio’. To demonstrate the fields it is sufficient to show that 
the apparent track for 0 of a P whose cosmodesic passes through the origin M has 
the appropriate curvature. 

{d) A field-producing system M whose space-distributed existence is the non- 
vanishing of the components of null angles between ® 0 , « and Lg at aU points in the 
cosmic manifold. M is not necessarily rigidly connected to 0, and the null angle 
may be composed of several simple ‘fields’ centred at a set of points occupying a 
definite volume of space, i.e. M may be of finite size or a point. 

Field theory then becomes the science of the relations between simple uncon- 
strained point bodies moving in cosmodesics and the space-extended rigid systems 
used by physical observers for making measurements. 

1-3. Notation and conventions 

It is convenient to replace the purely real (in the mathematical sense) sets of 
numbers and by the mixed sets’" defined by 

'I'qsifQ, ^q = ^Q; (^4- = 1,2,3). (Ma) 

1’4. Universal anti-time 

We assume that the general mass-system of the universe determines an unique 
direction of anti-time *0 in the cosmic manifold. All ®0’s at all times and in all places 
are therefore parallel. From this it follows that the cosmodesic of an unconstrained 
body P always makes a fixed angle with ®0, so that we can take an angle A such that 
^TT — A is the angle made by the oosmodesio of P with ®0- is the direction of time 
for Q, and the choice of the *Q direction fi:om among all possible time-like directions 
in the four world through P orthogonal to ®0 is equivalent to fixing the velocity 
of Q relative to the mass-system of the universe. Owing to the isotropy of space 
^Oj aJid ®0 can be arbitrarily selected as any three imaginary axes orthogonal 
to *0 and ® 0 . 

1 - 6 . TM rigid measwri/ng system of 0 

We now define the measuring system of 0 by means of five directions at any point 
(e.g. the origin R), having vector components nearly equal to those of the * 0 - 
These five directions will be designated by the unit vectors L^, La, Lg, L 4 , Lg, where 
L 4 is the direction at B along which 0 measures time; Lg is the direction at B along 
which displacement involves no change in the clock and ruler readings of 0, 
There can be only one such direction because the manifold is five-dimensional and 
0 can make four independent measurements. Ruler and clock measurements made 

* The suinination convention will be adopted as follows: 

indices j, n, s, v, w,p take values 1, 2, 3, 4, 5, 

O' take values 1, 2, 3, 4, 

ir, Tjf g take values 1, 2, 3. 

Indices will be written as affibces when they refer to Q and as suffices when they refer to 0. Thus 
% signifies the cosine of the angle between *"0 and the at the same point where is a vector 
associated with 0. 



Unified field theory in a curvature-free five-dimensional manifold 43 

along will be expressed by the symbol Ax^, where ju, = 1, 2, 3 and 4, but not 6. 
We also introduce at the origin B a direction xss having unit vector components in 
the Q-system jm, gtzr, xss like Lg, makes a small angle with but it is 

defined as being orthogonal to the four vectors L^, Lg, L3, at B. We have thus at 
each point of the cosmic manifold accessible to 0 three anti-time-like directions 
(1) which is universal, (2) L5 and (3) ta, the latter two being local and serving to 
fix the rigidity of O’s measuring system. 

1*6. Null intervals and null angles 

It is an obvious property of a complex geometry that it is possible to have null 
intervals 

BS = + - 0 , ( 1 - 2 ) 

where the Q are all real and finite. 

Similarly, we call the relation between a pair of unit vectors a ‘null angle’ when 
they have the same or different components and the cosine of the angle between 
them, defined in the usual way as their scalar product, is unity. The angle will be 
called a ‘zero angle’ when all the components are identical. 

1*7. Oravitational field 

The gravitational field is now defined as the situation which satisfies the condition 
that at all points in the manifold L5 coincides with and xa makes a null angle with 

with non-zero components that are themselves functions of ^Q. In other words 

%=1, (1*3) 

but = small quantity not zero such that 

2 = 0 and = 1. (1-4) 

1 

Since the direction of xa varies for different values of (/^ = 1, 2, 3, 4) it follows 
that the four-way measuring system of 0 does not lie in a fourfold. On the other hand, 
since the direction of stationary measurements for 0 (L5) coincides with ^0? dis- 
placement of the system along will leave aU measurements unchanged. It will 
be shown in the next section that is related to the ‘potential energy’ of the field 
in which 0 observes P as moving and that it is possible to express the components 
of xxS in terms of so that the equations of the gravitational field emerge in the 
required form. This turns on the double Hmitation on 0 — his rigid measuring system 
and his anti-time blindness — ^the combination of which give the same results as 
Newton’s theory with the small correction introduced by Einstein. 

1*8. Electrostatic field 

The electrostatic -field is defined as the situation which satisfies the conditions 
xss coincident with and L5 makes a null angle with having non-zero components 
which are functions of 

Eor this case, in general, displacement of the system along involves a change in 
O’s observable co-ordinates x^, x^, x^, 



44 J. G. Bennett, R. L. Brown and M. W. Thring 

The components %, % \> % of the unit vector L5 in the Q co-ordinate system, 

i.e. its direction cosines relative to the ®0) are real and imaginary numbers satisfying 

616=1 and V + V + V + V = 0- (1-6) 

In the next section it will be shown that % is directly related to the electrostatic 
potential energy observed by 0 for a P-body carrying unit charge. The angle 
Jtt— A which the cosmodesic of P makes with *0 is related to the charge E of 
P (tan A oc ElrrtQ). We have thus the necessary elements for constructing the electro- 
static field equations. The magnetic vector potential appears when the rotation of the 
vector set relative to the ^‘0 is given a velocity rotation relative to the Q and 0 
(i.e. a Minkowski rotation about 6Q) corresponding to the motion of the charged 
field-producing body M relative to Q and 0 . Electromagnetic fields thus arise where 
the four-way measuring system of 0 lies in a fouifold, but the direction of anti-time 
for 0 is not unique. 

1-9. Physical rigidity and geomebrical torsions 

Both types of field must be identified with the interpretation of the fact that the 
measuring system of 0 is constrained. 

The notion of a rigid body is derived from our common experience of persistent 
material objects. Since we are not concerned with any processes proceeding in the 
interior of such a body, it is irrelevant to consider its atomic structure. It may, 
however, be observed that the measuring instruments used in dynamical observa- 
tions are constructed of materials approximating as closely as possible to ideal rigid 
bodies, and the assumption that such instruments are available is common to all 
types of field theory. 

The significant properties which we have to discriminate for the purpose of our 
analysis are those of rigidity and constraint. 

It will be shown that the presence of a field is equivalent to the curvatures of the 
two-dimensional surfaces, traced in the five-dimensional manifold by the ends of 
a rigid ruler; this curvature arising because, unless the ruler is at right angles to the 
direction of the field, its two ends are in regions where the angles between the L* 
and the *0 are different. It can also be shown that the variation of these angles 
with the variation of a>“ implies that the L* forms an axis system with torsions 
(Cartan 1932). 

In the gravitational case, the magnitude of the unobservable displacement A*® 
along the invisible fifth direction corresponding to the interval BS is not uniquely 
determined by the numbers Aa:-", but depends, for example, upon whether the interval 
is traversed by taking Ax^ or Ax* first. In the electromagnetic case the curvature of 
the envelope of Lg is not unique but depends upon the ratio of d^q and d*q, i.e. upon 
the angle A. 

2. The httbperetation op absolute displacements 

AS PHYSICAL MEASUBEMBNTS 

2-1. General field conditions 

In the previous section we have established two independent geometrical con- 
structions both referred to the Q-oo-ordinate system of the cosmic manifold. One 



Unified field theory in a curvature-free five-dimensional manifold 45 

is the oosmodesic of the unconstrained point body P, determined by the angle 
^7r — A which it makes with The second is the rigid measuring system of 0, deter- 
mined by the unit vectors L^. each making small, null or zero angles with the corre- 
sponding axes through the given point. These constitute the whole equipment 
required for constructing a general unified field theory. 

Let the unit vector L^- have direction cosines with reference to the orthogonal 
reference frame such that 

= ( 2 - 1 ) 

The are assumed to be functions of This assumption is in fact the introduction 
of the most general kind of field possible. In order to obtain results which will be 
applicable to O’s observations and measurements, it is necessary to specify the 
conditions under which the may be expressed as functions of the space-time 
quantities where the xf^ are measured along curves in the cosmic manifold to 
which the are tangent vectors {ji = 1,2, 3,4). This is done in equation (2*2). 
First, however, we must develop the general rigidity conditions limiting the choice 
of the *Zj.’s. 

Consider an infinitesimal element d$ = R8 of the cosmodesic of P passing through 
P, the origin of the co-ordinate system. The point 8 has infinitesimal displace- 
ments d^q from P. Let be the infinitesimal displacements along the five direc- 
tions Ly corresponding to By definition of the 

d\ = %^xU ( 2 * 2 ) 

whence = ^t^d^q, (2*3) 

where is the cofactor of \ in || || divided by the value of this determinant. 

We now make use of the approximative assumption that the components of the 
Lj. are nearly the same as those of the Thus if e is a small number, we have, at 

most 

^Tsr, ^^ 4 , ^Iq, ^l^ ^ 0{e),’\ 

4^, % %, % = 0(6), j 

whence it follows that = 1 + O(e^). 

The orthogonality conditions required by the rigidity of O’s measuring system 
(cf. § 1*5) can now be stated in differential form. 

(i) xss is orthogonal to L^ giving 

iTZT 4- + 2^ \ + 3^27 -h ^vj \ = iO(e®), (2*5) 

with three similar equations for the and 

(ii) The are mutually orthogonal, giving 

\ + \ + \% + \% + = 0{e% (2*6) 


with two similar equations for the pairs (2, 3) and (3, 1). 



46 


J. G. Bennett, R. L. Brown and M. W. Thring 

(iii) is orthogonal to the projection of L* into = 0, giving 

\+\+W + \% = i0{^), (2-7) 

with two similar equations for the pairs (2, 4) and (3, 4). 

These conditions restrict the choice of the directions for both types of field. 
They are the minim um requirements which must be satisfied if 0 is to make measure- 
ments which are self-consistent in time. It can readily be seen that this is physically 
equivalent to the postulate that O’s measuring system coheres in such a way as to 
be self-identical in time to first-order accuracy. 


I 

2-2. The condition that the H/s can he expressed, as functions of 

2- 2-1. Gravitational fieM 
Here we have Lg in the direction of ®Q so that 

%= 1, = 0 and hence = 0. (2*8) 


It follows that equations (2-3) do not contain d^q. Hence there is no loss of gener- 
ality in considering only cosmodesics lying in = 0. 

Now in the gravitational field the do not lie in = 0, and hence the 
(when expressed as functions of d’‘q) must be integrable for aU possible The 
conditions for this are 


V) 

d'^q d'q 


(v+cr). 


(2-9) 


Equations (2-9) are thus the general differential equations of condition in the gravi- 
tational field. When they are satisfied the which are defined as functions of '^q 
can be expressed as functions of oo^. 


2- 2-2. General formulae for the co-factors jp 

Eor the purpose of developing (2-10) explicitly it is convenient to obtain the 
co-factors jp evaluated to 0{e®). We find, using (2*4) and “^6 = 


P = -3\ + i -ZA+ 0(e®) (j^k)', 
P = 2-% + iHpi^+0{e^). 

n=l 

n^k J 


( 2 - 10 ) 


Making use of (2*6) and (2*7) we obtain the alternative expressions 

p = ^l,, + %i% + ’’h) + 0{€^)A 


p = 

Z* = % + H.% 


n 


+0{^), 

+ 0(e3).J 


( 2 - 11 ) 


Equation (2’6) can also be put in the form that equals p, a result which will be 

used later. 



Unified field theory in a curvature-free five-dimensional manifold 47 


2'2-3. The co-factors in the gravitational field 
For the co-factors, from (2-8), (2-10) and (2-11) we have 

+0(e3) (g+,), 

= % + 0{^), 

/= % +0{e^), 

/ = -%+iVn+Oi€^), 

n»l 

6^= 0, 

,1^ = 1. 

2-2-4. Electromagnetic field 
Here « is in the direction of ®0 so that 


( 2 - 12 ) 


,lf = 1, / = 0, (2-13) 

®Z 5 being unity as before. In the electromagnetic field the necessarily lie in = 0, 
so that displacements along ®0 have no meaning for 0. Hence it is not necessary 
that all the shaE be integrable with respect to such displacements in order that 
the Hj shaU be expressible in terms of O’s co-ordinates It is, however, necessary 
that they shaE aE be integrable with respect to displacements di^q-, and also that 
Aa;* shaE be integrable with respect to displacements d^g as weE as displacements 
dT'q, for otherwise a charged bod^ situated at the same place as 0 and having instan- 
taneously zero velocity would disagree with 0 about the simultaneity of arrival of 
s^als. We have therefore 


M.JJL and litJ-sl 
di^q c'’q c^q c^q ' 


(2-U) 


2'2*5. Co-factors in <Ae electromagnetic field 
For the co-factors from (2’11), (2-13) we have 

/ = /‘Z,+ C>(€3), 

11=1 )• (2‘16) 
n^fi 

/ = 0 , 

jZ® = 1. 

The symmetry of the two kinds of field can be seen at once by comparing (2-12) 
and (2-16). It remains to show that they have the correct properties to entitle them 
to be interpreted as gravitational and electromagnetic fields respectively. 


3. Potential BNERaY and the oenebalizbd Lageangian 

3*1. The relation between constrained and free systems 

Our task is to translate out of the universal co-ordinate system the relations 
between the two sets 0 and P in such a way as to express them in terms of O’s 
physical measurements. This must be done without setting up a co-ordinate system 



48 


J. G. Bennett, R. L. Brown and M. W. Thring 

for 0, because, as we Have seen in (1*7), O’s measuring system need not lie in a fotir- 
fold. Moreover, such a procedure would leave us without the means of allowing for 
O’s ‘anti-time’ blindness. We must therefore confine ourselves to the system of 
directions along which 0 makes his measurements at a point and then find the moans 
of integrating the differential equations which express the fact that 0 uses a rigid 
space-extended structure. 

If we fix our attention' on the vector along which 0 measures time with his 
clocks, we have a space-distributed variation in the components of the null angles 
between Lg-^O and tB-®0 respectively. These components % and being 
variable functions of the os/* it must follow that the vectors through different points 
are not parallel to one another. It follows that the L^’s make a varying angle with the 
cosmodesio of P, and this in turn leads to O’s observation of P’s motion as curvilinear 
and accelerated. 

In order to use this result in the analysis, we have to express the variational 
condition for the straightness of the cosmodesio 

= 0 (3*1) 

in terms of O’s physical measurements, i.e. ofi and 

In defining O’s measuring system in § 2 we included an anti-time-like measurement 
Aic® (being the displacement along Lj measured on the same scale and in homo- 
geneous magnitudes with tox/*). This is permissible in the differential form, but A*® 
must be eliminated before integration, since the proof of § 2 that the are inte- 
grable (i.e. have a single value independent of the path of measurement) ^oes not 
apply to Aaj®. 

Now == 2 {Shqf = S 

k=l k-l 

from (2-2). In order to eliminate Ajc®, we use the relation 

®Z,Aa:’' = d^q = dssinA, 

whence, since as shown in (1*3) and (1*5) ^Zg is always unity. 

Ax® = (ZssinA— ^Z^Aa;/*. (3-2) 

3 

Thus ds® = S {^’Z^ Ax^ -1- *’Zg(ds sin A — ®Z^ Ax^*)}*, 

whence, using (2-4) to obtain approximate values permitted by the postulated 
smallness of the Z’s. and we obtain the relation* 

ds2cos2A-2dssmA|^S^’'Zg«'ZyAx/‘j = {®Z^®Z,-2®Z„|:*Z5*Z^ + 2*Z^fcZ,| Ax/‘Ax" (3*3) 

* Th,is equation contains th.© same terms as that assumed, by Mosharrafa (1948) as the metric 
equation for a meta-Riemannian’ space, and his proof that it gives rise to terms which are 
interpreted as gravitational and magnetic vector potentials can therefore be applied here. In 
the present paper, however, th© equation arises naturally as a consequence of the basic assump- 
tions of §1-2. 



Unified field theory in a curvature-free five-dimensional manifold 49 

between ds and Ai»“, in which the coefficient of is correct to 0(e®) and that of 
Axf^Aaf to 0(e*). 

This is the quadratic equation for the interval ds along the cosmodesio of P as it 
is found in O’s measuring system. 


3-2. The Lagrangian 

In order to exhibit the connexion between the variational equation for the 
straightness of the cosmodesio and the equations of motion we can define quantities 
L, v!^, and V by the equations 

L dscosA 


1 - 


moC* AiC* ’ 

Aa:* " c ~ ’ 

yi. 

— = 1^=1, 
c 


s {vff = VK 


(3-4) 


Inserting from these in (3- 1) we obtain the form 


moc^) 


secAAo::^ = 0. 


(3-6) 


Since A and are time and space constant, this is equivalent to the classical 

variational equation d^Ldt = 0 for the motion of a particle in a field of force. 

We can accordingly derive the potential energy of the motion from (3-3) by the 
following quadratic in L\ 


(3-6) 

In order to interpret this equation in terms of O’s physical measurements, we may 
consider the approximate forms which are appropriate for the various types of field. 


3-3. The gravitational field 

Applying the equations of §2*2 and the approximations permitted by (2-4) to 
equation (3-3) we have 

ds^ cos^ A = S { 1 — + 1 S 

Using (2*6) and (2*7) to evaluate the coefficients of the last term, this gives 

ds^ cos2 A = S {1 - {xjr =1= ^), (3*7) 

which reduces to 

ds^ cos2 A = S - {%? - {%Axtf. (3*7) 


Vol. 198. A. 


4 



50 J. G. Bennett, R. L. Brown and M. W. Thring 

Where % is not zero, and we are dealing with the case of spherical symmetry about 
a point distant r from It, this equation can be transformed into spherical polar 
co-ordinates by writing 

% ] 

” 1 ^’ 

ir = 

(this is legitimate since S 4= 0, so that S (e^)^ = 1 and hence may be ' 

A “1 1^-1 

regarded as a set of direction cosines in space at the point R, independent of r) 
and dr2 + j-ad612+r2sin2d#2 = -S(A!ri^)2. 

f-X 

Then (3-7) takes the form 

ds^ cos2 A = {1 - (SZJ®} (Aa;*)2 - r^dd^ - sin^ dd^^ - dr\l -H (3.9) 


Equation (3'9) is recognized immediately as having the same form as the Schwarz- 
schM equation providing the function ^^4 = f{r, d, <f>) is correctly assigned. It must 
again be emphasized that (3-9) is not a metric equation in the sense of determining 
the space constants of a metrical manifold. It is simply the statement of what 0 
will find when he makes measurements in his own rigidly constructed system upon 
the motion of the unconstrained body P. It follows that the apparent track of P 
as observed by 0 satisfies all the physical requirements of a body falling in a gravi- 
tational field including the Einstein correction to the Newtonian theory. In the next 
section we shall derive the function *^4 from the differential equations of condition 
obtained in § 2. 

It will facilitate the work of physical interpretation if we show how the potential 
energy is derived from (3-6). When we omit the Einstein correction terms (corre- 
sponding to dr^{\)^ in (3-9)), (3-6) becomes 


Thus P = moc^ll - y [1 - (%)^ - 5]} . 

which may be compared with the classical Lagrangian 

clearly the potential energy Qg of any body in a field of this type is proportional to 
its inertial rest-mass, and is always negative (giving an attractional force). This, 
follows from the fact that ^^4 is a real number and we have approximately 


a 


a~ 




(3-10) 



Unified field theory in a curvature-free five-dimensional manifold 51 

3-4. The electromagnetic field 

Inserting = 0 and making use of the approximation of (2-4) we get (3’6) in 
the form 


mocV \ ^ 


S tan A = ( S = S 

I U-i / /i=-i 


by (2*6) and (2-7). 

Now we can make use of the fact that by hypothesis the potential and kinetic 
energies are both small compared with unity and complete the square on the left- 
hand side to obtain 

L== moC^[l - y (l -5) _ (I tan a] . 

In this expression — ^to exhibit a familiar form — ^we have retained terms under the 
radical of the same order as those neglected in the approximations. 

We have thus a field of force with potential energy given by 


fig = mgC^tanA 




( 3 - 11 ) 


Now A is a property of the unconstrained body P and is independent of the field 
(being simply the angle by which its cosmodesic diverges from perpendicularity 
to ^Q). We can therefore write „ 

m^c^tanA = (3*12) 

where ^ is an universal constant and P is the electric charge upon P. 

Prom this it can be seen that we are dealing with a field which does not act on 
bodies in direct proportion to their inertial mass. In particular, a body whose cos- 
modesic lies in = 0 has A = 0 and therefore zero potential energy everywhere. 
Moreover, fields of this type can be attractive or repulsive according to the signs of 
the numbers tan A and ^1^ one of which belongs to P itself and the others to the field. 

In order to complete the demonstration that we have all the properties of electric 
fields, we need to show how to express QJE in terms of a vector potential. Since, 
according to the basic assumption for this type of field, L^, Lg, Lg and aU lie in 
= 0 and are mutually orthogonal, the can only correspond to the direction 
cosines of a Minkowski velocity rotation. Referring the latter to the system, we 
can assign real components (^?7, to the velocity and define the mixed quan- 

tities ^^fc, % and u by the equations 
qifTJ 

_ = fu, % - 1, + + (3-13) 

c 

Then the "Z^ are given by v^’Z^ = 1 — , 1 


% - - 2 - V’ 
H. = fu = -n., 


H - 1 -— 


if ^7]) 


{3-14) 


4-2 



62 ' J. G. Bennett, R. L. Brown and M. W. Thring 

substituting these values in (3- 11), assuming that and u are 0(e) and omitting 
terms in 0(e®), we obtain 

= S + 'l'U%) + ( S + \] • (5- IS) 

■ O ' ^=1 ) 

This differs from the classical form for the potential energy of a charged body 
moving in a field which has magnetic four-vector potential 

As%(%,2tt,®M,l), (3-16) 

only by the presence of terms in Of these, the one entering the electrostatic 

3 

component, viz. S is a small correction on the main part \ of this component 
^=1 

because is 0(e) and so is 

The terms in in the magnetic components appear to be large compared with the 
classical term but they do not give rise to any forces because the magnetic 
force is given by curl A and curl(%) vanishes identically. This follows because 

4 3 

2 {%)^ = 0, so that can be expressed &sf^\ where S iff)^ = !• (3‘17) 

jr=l . 

Hence are the components of a space-vector of magnitude \ and direction 
^ = considerations of spatial symmetry it can be seen that this 

direction n coincides with the direction of maximum rate of change of *^5. It follows 

that % are, the components of the gradient of the scalar and hence curl (H^) 

vanishes identically. The term curl ^u% does not of course vanish in this way because 
the vector u is not parallel to n. The demonstration of the electromagnetic field is 
thus complete. 


4. The inverse square law oe eorob 
4- 1 . The conservation principle 

The expressions we have derived for potential energy have been shown to possess 
the properties appropriate for the gravitational and electromagnetic potential at 
a point. It is necessary to show also that these are distributed in space according to 
the inverse square law of force. We shall here undertake this demonstration to the 
same approximation as we obtained for the co-factor in § 2. 

It is easy to see that the inverse square law is associated with the time constancy 
of the field and the three-dimensionality of space. In view of the distinction we have 
drawn between Q and 0, we cannot make an assumption of the type used in classical 
theory, as to the vanishing of the divergence of a gradient, since this is no longer 
obviously an ‘absolute’ property of O’s material objects in the cosmic manifold. 
We have, however, in the physical conservation principle, geometrically interpreted 
according to § 1 -2 a property which must apply to both types of field. 

The physical principle of conservation of energy is connected with the condition 
of integrabilLty of the measurements of 0. These conditions were given in § 2, 
equations (2-9) and (2-14). We have to demonstrate how these conditions lead to 



Unified field theory in a curvature-free five-dimensional manifold 53 

the inverse square law for precisely the two kinds of potential energy we have found 
and no others that are not merely combinations or derivations of these. We do not use 
all of equations (2*9) and (2* 14), nor shall we work, in the present paper, to a higher 
degree of approximation than in § 2. 

4* 1 -2. Relative rigidity 

A physical observer 0 measures space with a ruler, which we can represent by a box 
naving sides of length AXi, AZg, AZ3 along the vectors It will be recalled 

that these three vectors are mutually orthogonal, so the volume of the box is 
AXi. AXg-^^s* Let A, B be the vertices of one of the sides, say AX3, of the box. 
Translate A to A' along L„ i.e. in the direction through a displacement 8^. 
Similarly, translate B to 5' along the time-line through B having direction 
^^4 through a displacement 8x^, To the first order 

% = "^4+^AZ3. (4.1) 


But in Q’s measurement, using (2-2), and taking A as the origin of co-ordinates, 
co-ordinates of jB = AXg, 'y 

co-ordinates oi A' 8oc^, I 

co-ordinates of JB' = AXg-h^Zi-^ic^J 

whence 

{A'B'Y = + 

so that to 0{Sa*), ' A'B' = +0<Ja^+ 0{{da^f, eejJ , (< 

where {4 

Evidently the box remains, to the first order, a box, and its volume changes by 




U-iaW’ 


where the term 0(Soi^ee^) has been neglected. It foUows that the volume of the box 
remains constant for Q if 3 - , 



4*2. The gravitational potential 


To derive the law of force we make use of the condition (2‘9) that 

shall be 

integrable, namely, 

d^q ~ d*q 

{4-7) 

From (2*7, 8, 10) 


(4-8) 

where 


(4-9) 


1 


and T is the velocity (for 0) of the freely falling body having cosmodesic 



54 J. G. Bennett, R. L. Brown and M. W. Thring 


Also from (2-7) / = % + 0(e»). (4-10) 

Hence (4*7) becomes 

+ (4-H) 

where* 

AT ^ _V?^A._V P — 

dtq ^-yd^qdop fc5i’ 9a:*’ 

whence 9^= + + 

Hence (4-11) may be written 

^^^{%) + 0{eH„eH'). ( 4 . 14 ) 


Differentiate (4-14) with respect to x'^ and sum over Tj 1 ... 3, obtaining, 


= 2^ ( + 0{eel ee^e', e\, eV), {4-16) 

where V = (i-18) 

^ dH 

Now for O’s time x^, S vanishes in virtue of the assumption made in (4*1), and 

we have, to the present order of approximation, f 

« 0(e%2,ee|). (4*17) 

It was shown in § 3 that is the gravitational potential and clearly the Lag- 
rangian implies, if relativistic corrections are neglected as is the case in the present 
section, conservation of energy in the form 

i{\)^ + = constant. (4-18) 

whence (4-17) yields “ 0. (4*19) 

So that the gravitational potential satisfies Laplace’s equation. 


4*3. The electrostatic potential 

From the second set of equations in (2-14), one of the conditions of Integra bility 
of is 

n/ /»! /*! io\ 

(4*20) 


or 


8*j 3*5 ” 3*5 


(4*21) 


* In a more precise treatment to be given later, it will be shown that the integrability conditions 
i^piy that not all of the Hf {h =)=^ ) can be 0(e) ; some must be of a greater order of small quantities- 
t I-e. taldrig O(e') = Oie^). 



Unified field theory in a curvature-free five-dimensional manifold 55 

by the same steps as for the gra^tational potential. Hence 

+ (*- 22 ) 

giving Laplace’s equation approximately as before. 

It wiU be noted that both the equations governing the law of force follow from the 
condition of relative rigidity given in §4*1 — ^in a later publication a more exact 
treatment will be given. For the present purpose it is sufficient to record that the 
Laplace equation is evidently a consequence of the condition that oc^ shall be a curve 
in the cosmic manifold uniquely determined at every point of a rigid body. 

5. The ORAviTO-i^aNETic field 

5- 1 . The field of a rotating massive uncharged rigid body 

As discussed in § 1 , the concept of rigidity is as fundamental to the present theory 
as it is to classical dynamics. We have, moreover, imphed that absolute rotations 
exist by postulating a Q system of co-ordinates with respect to which such rotations 
can be measured. Now consider the case in which a relatively light mass 8M rotates 
around a heavy mass if in a circular path. If 8M is freely falling, i.e. held to M only 
by gravitational or electromagnetic attraction, it describes a cosmodesic, and its 
own field will be that produced by a body of its charge and mass rotating as it 
rotates — ^the equations of §§ 3 and 4 being sufficient to give this field. Let u (defined 
as in § 3-4, equations (3-13)) represent the Minkowski rotation in the Q system corre- 
sponding to the velocity of 8M. Then this means that at any given moment the dis- 
placement of O’s Li . . . L 5 from ^0 . . . Si't another point R will be that corresponding 
to the charge and mass of 8M^ but with the Minkowski rotation about correspond- 
ing to the velocity of 8M. 

Suppose now that 8M is connected by a rigid arm to ikf , but still rotates at such 
a velocity that centrifugal force balances gravitational attraction. At first sight the 
situation would appear to be unchanged, since the motion of 8M is kinematically 
identical, and there is no force in the arm. But clearly the potentialities of the situa- 
tion are now quite different; for example, 8M would respond quite differently to a 
frictional retardation or an« impact with another body. Hence the situation 
with regard to anti-time must be different. The difference is, in fact, that instead of 
the inotion of 8M producing the difference between L 1 ...L 5 and ^0 ...®0 corre- 
sponding to the rotation u about ^ 0 , this difference is rotated by u about the line 
orthogonal to viz. xss. This effect will of course apply equally whether u is 

the velocity which 8M would have if it were unattached or any other velocity, but 
it is of special importance to show that it does not vanish even in the former case. 

We shall now show that, in the case of the rigid attachment of a rotating, un- 
charged but massive 8M to M, 8M produces a magnetic field as if it had a charge 
numerically equal to its gravitational mass but without the corresponding electric 
field. 

We define the following sets of co-ordinates and directions. and xt^ are respec- 
tively the unit vectors and the measurements of the physical observer 0 at the 



56 J. G. Bennett, R. L. Brown and M. W. Thring 

point JR, made under the influence of the constrained motion of SJMJ, vs is orthogonal 
to the lJ‘. 

L' and x'/^ are the unit vectors and the measurements which 0 would make at the ■ 
point JR, if SM were present but had no constrained motion. Hence 

di^q = ( 6 - 1 ) 

where the Hj are written for the gravitational Hj of (2*2) for the field of SM. vs', 
defined as the direction through B orthogonal to L^, coincides with xa because the 
velocity of the field is constrained. 

Since and are not orthogonal sets of vectors, it is not possible to express the 
relation between them simply by a Minkowski rotation of the form given in 
equation (3-14). We therefore use orthogonal co-ordinate sets Y*, Y'*, which are 
related to one another by the Minkowski rotation u. 

The direction of for ^ = 1, 2, 3 is that of whilst F® is in the direction of xa. 
F* is then orthogonal to all the and to F®. The F'* are similarly related to 
and xa'. Now F®, F'® have the same direction xa, so that 

dY'/' = /‘A„dY>', (5-2) 


where 








(6-3) 


Writing = ’‘li'j.dY'^, 

we have from equations (6-1) and (2-5) 

% = % (f =1,2.3),! 

Ys = k^> ) 

and from the orthogonality of Y'^ to T'® 

i(W =1,' 

1 

2:(%Vi) = o, . 

1 

6 

=0. 

From (5-2) and (5-4) d’^q = (%%)dY^, 

where we have taken = ®A^ = 0, ®As = 1. 

Now by definition (equation (2-3)) 

Aa:'« = J^d'^v = X) d F'», 


(6-4) 

( 6 - 6 ) 

( 6 . 6 ) 

(6-7) 

(6-8) 



Unified field theory in a curvature-free five-dimensional manifold 57 

and by the construction given above, the corresponding relation between Ax'^ 
anddr«is 

Ax'’ = (by definition of %), (6-9) 

using (5-7) we must have 
which by way of 

leads to = 

Further, it follows directly from the construction that 

If now we return to the general expression for the Lagrangian (3-6) we can see from 
the foregoing that the cross-product terms in Axi^ Ax” are the same as those obtained 
in (3-7) for the gravitational field. There is, however, a linear term in because 
%=¥% and the former therefore are not zero; whence it follows that 

(5-12) 

V=1 

We may choose to be of the form 

%=l-hO(e2), ”l^=0{e) (vdp/i). 

It is necessary, however, to calculate ”1^. From equations (6'10) and (2*8) 

% = (6-13) 

It is easy to evaluate if terms in {%'!'. vPi) are neglected; for by (5-3) and (6-8) 

where we have used the fact that ’A^ = 1 -f 0(«®) in deriving the first term on the 
right-hand side. But, by equation (2-10), 

5^'>’==_v^'+ 2 = s + 

1 1 

n^v 

by equation (5'6). Then since — 0 and neglecting terms of order (e®, lie®) ■ 

or + 0(e%, li®, e®). (5-14) 

FmaUy, *ls = 

or *^5 = 0(e®,it®). (5-15) 

By a similar procedure we find that \ is equal to unity to the order (e®, m®). Sub- 
stituting these values in equations (3-11) and neglecting u'l'\ in comparison with 
as (6-14) and (5-15) have shown to be permissible, we find a potential energy 
for a body having charge E and moving with velocity v in the field of 8M given by 

® 

V = SX«M+0(e®,ii®). 

rs' ti-=i 


(5-16) 



58 


J. G. Bennett, R. L. Brown and M. W. Thring 

Comparison of (6-16) with (3-16) shows that the first three terms correspond to an 
electromagnetic field of magnitude such that ^ replaces *^5; now by (2-6) ^ is the 
same as — which is the gravitational rotation due to 8 M ; from this it follows that 
a moving gravitational unit of matter 8 M which attracts an equal mass at 1 cm. 
distance with a force of one dyne, will produce a magnetic field corresponding to an 
electrostatic unit of charge defined the same way and having the same velocity. 
This is Wilson’s (1923) hypothesis to account for the magnetic field of the earth. 
The fourth or electrostatic term is not ^ being an order of magnitude in smaller. 
It follows that the gravitomagnetic field should not exhibit an observable electro- 
static part. 

In the case of a rotating rigid massive sphere, integration of this magnetic field 
gives a magnetic dipole of moment P given by Blackett’s (1947) relation 

P = (5d7) 

where U is the angular momentum and = 1 . It will be noted that the derivation 
requires that the rotating body should be rigid and that small departures from rigidity 
will have a marked effect on the field. For incompletely rigid bodies like the sun and 
the earth ^ will therefore not have exactly the value unity. The results of Hales & 
Gough {1947) show that the earth’s magnetic field decreases on descending a deep 
mine. This is in conformity with the hypothesis that the magnetic field is a con- 
sequence of the gravitational field and cannot be explained on the hypothesis that 
the magnetic field arises from the presence of electric currents flowing in the Earth’s 
core. 

6. Discussion 

The theory put forward in the present paper seeks to derive fields directly from 
the data of experience — ^namely that physical processes involve ‘ observables’ and 
‘unobservables’. This requires a five-dimensional framework which is taken as 
a simple extension of Minkowski’s (1908) ‘absolute world’ by adding a fifth 
orthogonal direction labelled anti-time. We then make the distinction between 
observable effects in time (e.g. kinetic energy) and unobservable effects in anti- 
time (e.g. potential energy). We further adopt the ‘common sense’ notion of 
enduring objects in the form of the idealized physical observer 0 with his measuring 
system of rigid rulers and clocks and the observed body P idealized as a point 
mass moving without constraint. The absolute co-ordinate system of the reference 
framework is defined by means of a hypothetical absolute observer Q. 

Three subsidiary conceptions which arise naturally from the fundamental ones 
are then formulated. The first is that of ‘true’ straightness defined by straight 
lines in the Q co-ordinate framework. The unconstrained P-body is associated 
with a straight time line called the cosmodesic. The second subsidiary conception 
concerns the rigidity of the measuring system. This is defined by a set of unit 
vectors at every point. If a constraint (i.e. a field of force) is present these vectors 
do not coincide with any set of Q-co-ordinates, but diverge from them by small 
angles. Finally , we have the third notion of ‘ anti-time ’ blindness. This is implicit 
in any five-dimensional theory as de Broghe (1927) pointed out as far back as 



Unified field theory in a curvature-free five-dimensional manifold 59 

1927, but its consequences for field theory have apparently never before been 
sufiiciently explored. 

With these simple notions it is possible to set up a field theory which exhibits 
a remarkable symmetry between gravitational and electrostatic fields. It gives 
a complete account of electromagnetism and it furnishes a generalized Lagrangrian 
from which the gravitomagnetic effect can be derived and the absence of an 
associated electrostatic field explained. 

These results appear to run counter to the demonstration that the Sehwarschild 
metric cannot be embedded in a curvature-free fivefold. The explanation is 
simple: the fundamental equation (3-3) is not that of a metric but of a system of 
directions at a point. We are not concerned with the metric inherent in the 
framework, but with peculiarities in the measuring instruments used by 0 to 
observe the motion of P. While it is not quite true to say that ‘ curvature is in 
the eye of the beholder’, our theory shows that a ‘truly’ straight path may have 
the appearance of an accelerated motion in a central force field — solely due to 
the property of rigidity as constraint without deformation and the anti-time 
blindness of the physical observer. 

The merit of a theory largely turns upon the range of observations of which it 
can give a simple and consistent account. The theory of a five-dimensional world 
has proved of value outside field theory. It is weU known from the works of 
Kaluza (i9zi),Klem(i926, 1927) deBroghe (1927), Flint (1942, 1945), Fisher (1929), 
Rosenfeld (1927), Wilson (1928) and others that there are several attractive 
features in a five-dimensional wave mechanics — the wave function being associated 
with a periodicity in the fifth dimension. Throughout Eddington’s treatment of 
fields as a system of relations between pairs of particles, he makes much use of 
the system of five independent parameters (Eddington 1936 and 1946), but he 
does not regard the fifth co-ordinate ‘the phase co-ordinate’ as having the same 
physical status as the other four. 

We hope to show in a future communication that a very satisfactory treatment 
of wave mechanics can be given within the framework of the present theory. 
It also suggests the nature of the exchange forces and the meson field theory 
developed for the purposes of nuclear physics. In our view, general field theory 
must be regarded as the foundation upon which any cosmology must be constructed 
in our day. One of the most disappointing features of general relativity has been 
the failure to build a bridge to join it to the rest of physical science. It has been 
our aim to construct a scheme so closely related to our common experience that 
it should be equally applicable to the description of all physical processes. In 
particular, we feel the need to establish a descriptive scheme which shall enable 
dynamics, atomic physics and the statistical systems of thermodynamics to be 
exhibited as a single coherent system. We believe that this is not possible so long 
as four-dimensional space-time — ^however generalized — ^is taken as the framework 
to which all physical processes must be referred. We have endeavoimed to show 
that a consistent and fruitful world-picture is obtained by extending the space- 
time framework to a five-dimensional scheme free from the complications of a 
Riemannian or affine geometry. 



60 


J. G. Bennett, R. L. Brown and M. W. Thring 


Appendix. List oe symbols 

Q absolute observer with five orthogonal rectilinear co-ordinates 

is called anti-time axis of Q. 

\ ... ^2 mixed co-ordinate set related to ... by equations (Lla). 

(X'-, X^, X®) (X*) ruler and clock measurements of physical observer 0. 

...oc^ mixed sets of variables related to X^ . . . X* by equations ( 1 * la). 

P Jfreely falling body characterized by a straight track (cosmodesic) in 

the five-dimensional manifold. 

— A angle between 0® and the cosmodesic of P. 

E electric charge of P. 

mo inertial mass of P. 

Li . . . L 4 unit vectors through the origin B along which O’s ruler and clock readings 

increase at maximum rates respectively. 

Lg unit vector through B along which O’s ruler and clock readings remain 

stationary (the direction of anti-time for 0). 
ia unit vector orthogonal to L 4 ,L 2 ,L 3 , L 4 , having components (jV7 ... gw) 

with respect to Q. 

j, h, n, s, V, <j), p super- or subscripts which assume the values 1 , 2, 3, 4, 6 . 

/i, V, (T super- or subscripts which assume the values 1,2,3, 4. 

^ super- or subscripts which assume the values 1, 2, 3. 

The double suffix summation convention applies to each set over 
these values. 

% purely real or purely imaginary numbers specifying the direction of Lj, 

in the system. 

Ax^ infinitesimal displacements along L^. 

Cl potential energy of P in a field of force. Cl^ gravitational, Cl^ electro- 

magnetic. 

t time co-ordinate of 0. 

jp the co-factor of Hj in I Hj || divided by the value of this determinant. 

r, 6, <!> spherical polar co-ordinates in the space of 0. 

direction cosines in space defined by equation ( 3 ' 8 ). 

V velocity of P relative to 0, components {V[, TJ, 1^). 

Vi,V 2 ,v^ defined as iVJc, etc. 

C an universal dimensional constant. 

U (in § 3) a velocity defined by the values of the in an electromagnetic field 

(identified with the velocity of the field-producing body with respect 
to Q and 0). 

A electromagnetic four-vector potential defined by ( 3 * 16). 

(/i> f%> fz) direction cosines in space defined by (3‘17). 
e, ^ small quantities. 

fig the sth differential of e with respect to any 

the sth differential of 8 with respect to any i^q. 

P the field-force vector in classical field theory. 

element of mass of a field-producing heavy body. 



Unified field theory in a curvature-free five-dimensional manifold 61 

U (in § 5) velocity of dM with respect to Q . (i7i, U^, u^) defined from U 

in, the same way as (vi, v^, v^) defined from V. 

directions of measurement, and measurements respectively which 0 
would have at B under the field of SM if SM had no velocity with 
respect to Q, 
direction cosines of 
xss' direction orthogonal to 

axes along L^. 7® axis along xss. axis orthogonal to 7^ and 7®. 

7'^ axes along 7'® axis along tu' (== tzt). 7'^ axis orthogonal to 7'^ 

and 7'5. 

direction cosines of with respect to 7'^. 

^/JL^ direction cosines of 7'^‘ with respect to 

dl the Kxonecker delta ( = 0 when t;=j=a>, = 1 when v = oj). 


RbI'ERENCES 

Blackett, P. M. S. 1947 *^Nature, 159, 658. 
de Broglie, L. 1927 J. Phys, Radium, 8, 65. 

Cartan, E. 1932 Actualite.B Scientifiques et Industrielles No. 44 

Eddington, A. S. 1924 Mathematical theory of relativity, 2nd ed., p. 149. Cambridge University 
Press. 

Eddington, A. S. 1936 Relativity Theory of Protons and Electrons. Cambridge University 
Press. 

Eddington, A. S. 1946 Fundamental Theory. Cambridge University Press. 

Fisher, J. W. 1929 Pfoc. Roy. Soc. A, 123, 489. 

Flint, H. T. 1942 Phil. Mag. 33, 369. 

Flint, H. T. X945 Phil. Mag. 36, 365. 

Hales, A. L. Sc Gough, D. I. 1947 Nature, 160, 746. 

Kaluza, Th. 1921 S.B. preuss. Akad. Wiss. p. 968. 

Klein, O. 1926 Z. Phys. 37, 895. 

Kleui, O. 1927 Z. Phys. 41, 407. 

Minkowski, H. 1908 Nachr. Qes. Wiss. Gottingen, p. 53. 

Mosharrafa, A. M. 1948 Phil. Mag. 39, 728. 

B-osenfeld, L. 1927 Bull. Acad. Roy. Beige, p. 304. 

Wilson, H. A. 1923 Proc. Roy. Soc. A, 104, 451. 

Wilson, W. 1928 Proc. Roy. Soc. A, 118, 441. 

Wilson, W. & Cattermole, J. 1939 Phil. Mag. 30, 84. 



The heats of formation of free CN and free CIIj, and the 
relationship between Z)(CO), Z)(CN) and l^(Na) 

By L. H. Long, Ph.D., University CoUege, Exeter 
(Communicated by It. Q. W. Norrish, F.R. 8 . — Received 3 N ovemJber 1948) 

The heats of formation of CN and CHj have been calculated by a mimber of independent 
methods and shown to have the approximate values —92*5 and —70 kcal. respectively. The 
former value, which is independent of any data on cyanogen, receives close support from 
a recent estimate of the lattice energies of sodium and potassium cyanides, and corresponds 
to ^114 kcal. for the heat of dissociation of OaNj into two CN radicals. 

The information provided by the heat of formation of CHa regarding the energies involved 
in the stepwise dissociation of methane furnishes an indication of the true values for the heat 
of atomization of graphite and the dissociation energy of carbon monoxide. 

The relation linking D{CO), D(CN) and I>(Na) has been recalculated from tho derived 
value for the heat of formation of CN and utilized as far as possible for deciding on the 
correct dissociation energies. Of the four values which have been proposed for D(0O) and 
the three values for D(N 2 ), it is possible to eliminate two of the former and one of the latter* 


Introduction 

The dimensions of the dissociation energies of CO and have been conclusively 
settled in neither case, there being at least four values contested for the former and 
three for the latter. This being so, one cannot afford to overlook the manner in which 
they are interrelated, since the fixing of either value should lead, in conjunction with 
sufficient evidence concerning CN, to the fixing of the other. The relationship takes 
the form 

D{CO) = D(CN)-|I)(Na) + tckcal, (1) 

where x involves and the heats of formation of CO and ON from graphite, 
molecular oxygen and molecular nitrogen respectively. 

A relation of this kind has recently been employed by Springall (1947) in an 
attempt to decide between the various values proposed for the heats of atomization 
of carbon and nitrogen. Quite independently, Long & Norrish (19460) examined the 
question of the heat of atomization of carbon, and concluded from several other lines 
of evidence that the high value around 170kcal./g.-atom selected by Springall is 
not a permissible figure. Since the respective authors draw opposing conclusions, 
some explanation of the discrepancy is called for. In the present paper an explana- 
tion is suggested, and further evidence relating to the heats of atomization of carbon 
and nitrogen discussed. 

With respect to the thermochemical values from which x (equation (1)) is derived, 
the heat of formation of CO (Wagman, Kalpatrick, Taylor, Pitzer & Rossini 1945) is 
accurately known from the heat of combustion, and the long-accepted spectroscopic 
value for DiO^) (Herzherg 1939) remains unchallenged. The position regarding the 
heat of formation of the free CN radical is far less satisfactory. Hitherto, this has 
always been calculated from the heat of formation of CgNg and its heat of dissociation 
into two CN radicals. Apart from the fact that the heat of formation of C2N2 is 

[ 62 ] 



The heats of formation of free CN and free CHg 63 


probably not known with an accuracy better than ± lOkcal./mol., no decisive value 
for the energy absorbed in the process 


C2N2->2CN 


is available. Published experimental values exhibit wide disagreement, the extreme 


figures being 77 and 146kcal.: 


Hogness & Ts’ai ( 1932 ) 
Kistiakowsky & Gershinowitz ( 1933 ) 
White ( 1940 ) 

Bobertson & Pease (1942 a, b) 


BQ^C—CN) 

(kcal.) 

>127 
77 (±4) 
146 (±4) 
125-130 


Various authors have selected diflferent values on which to base their calculation. 
Obviously, confirmation for the chosen figure by an independent means is required 
before confidence can be placed in any one of their widely differing conclusions. 

Fortunately, three other lines of experimental evidence are available which have 
not been applied to this problem before. They are all independent of any work on 
C 2 N 2 , and rest upon known facts concerning other molecules containing the CN 
group. The heats of formation of these molecules are required, and it would be 
convenient to discuss these first. 


The heats of formation of certain cyanides 
Methyl cyanide 

The heat of combustion of CHgCN per mol. at constant pressure is given as 
312-14kcal. for the gas by Thomsen ( 1905 ) and 304-Okcal. for the hquid by Lemoult 
( 1909 ). These figures have been corrected by Kharasch ( 1929 ) for more accurate 
physical constants, as follows: 

heat of combustion 
(kcal.) 

CHgCN gas (Thomsen) 310-4 (^ 8 ®C) 

CHgCN liquid (Lemoult) 302-4 (room temp.) 

Since the heat of vaporization at the relevant temperature is 8*05 kcal. (Heim 1933 ), 
agreement is excellent, in fact, fortuitously good. This value for the heat of com- 
bustion leads to — 19-8 kcal. for the heat of formation of gaseous CHgCN at 18® C, 
which figure corresponds to — 21-4 kcal. at 0® K. 

Iodine cyanide 

The only data for ICN are due to Berthelot ( 1875 ), who measured the heat of 
formation from potassium cyanide and elementary iodine in aqueous solution, and 
also the heat of solution of ICN. Berthelot’s figures have been combined with 
modern thermochemical data at the National Bureau of Standards, Washington, to 
give “ 40-4 kcal. for the molecular heat of formation of solid ICN from its elements 
in their standard states at 25° C.* This figure, together with the heats of sublimation 

* P. D. Bossiixi, private communication. 



64 


L. H. Long 

of iodine (Gillespie & Fraser 1936) and ICN (Ketelaar & Kruyer 1943), yields 

- 47-3 kcal. at 26° C corresponding to - 47- 1 koal. at 0° K for the heat of formation 
of gaseous ICN from gaseous I^, and graphite, this being the quantity which will 
be required subsequently. (In the absence of experimental data for the heat capacity 
of solid ICN above 25° C, the fact that measurements of Ketelaar & Kruyer refer to 
the temperature range 64 to 163° C and not to 26° C has been ignored. The error 
thereby introduced would not amount to more than a few tenths of a kilocalorie.) 

Hydrogen cyanide 

The heat of formation of gaseous HCN can be calculated from the heat of com- 
bustion, for which Berthelot (1881) and Thomsen (1905) give figures differing by 
only a few tenths of 1 %. According to Kharasch (1929), Thomsen’s figures are to 
be preferred for gases and very volatile compounds, those of Berthelot usually being 
too high. KFarasch further recommends applying a correction of — 0-4% to 
Thomsen’s figures to bring them into accord with modern determinations. This 
reduces Thomsen’s figure from 158‘62 to 168-Okcal., which with modern data gives 

— 29-8kcal. for the heat of formation of gaseous HCN at 18° C, or — 29*9 kcal. 
at 0°K. 

Cyanogen 

From published and rather discordant data on the heat of combustion of CiNg, 
Bichowsky & Rossini (1936) calculate the molecular heat of formation of gaseous 
cyanogen from diamond and hydrogen to be - 71 kcal. In this the figure 94- 45 kcal. 
was utilized for the heat of formation of COg from diamond. Employing instead the 
figure 94-06 kcal. for the heat of formation of COj from graphite as reference state, 
the value for CgNg becomes -71-8kcal. at 18° C or - 71-4 kcal. at 0°K. 

The reduction of the foregoing heats of formation to absolute zero was accom- 
plished with the aid of thermodynamical data from Wagman et al. ( 1 945 ) for graphite, 
hydrogen and nitrogen, Giauque (1931) for gaseous iodine, Thompson (1941) for 
methyl cyanide and cyanogen, Stevenson (1939) for iodine cyanide and Gordon (1937) 
for hydrogen cyanide. 

The heat oe eormation oe the eree CN kadioal 

The long wave-length limits of the second regions of continuous absorption of 
CHjCN and ICN are 1600 A (Herzberg & Scheibe 1930; Cutler 1948) and 2100 A 
(Badger & Woo 1931; Mooney & Reid 1931, 1932) respectively, these regions corre- 
sponding to phqtodissociation in which, as demonstrated by fluorescence, CN 
radicals are produced in the excited B state (Terenin & Neuimin 1934, 193S; 
Nemmin & Terenin 1936; Yakovleva 193^)- The respective limiting energies are 
178-6 and 136-1 kcal. Subtracting in each case the excitation energy of the CN 
radical,, namely, 73-6 kcal., the values obtained for ^(CHg-- CN) and I>(I— CN) 
are 105-0 and 62-6 koal. respectively. If the products of photodissociation carry 
away excess energy with them, whether translational, rotational or vibrational, 
then these figures must be reduced correspondingly. From either value the heat of 
formation of the CN radical may be deduced. 



The heats of formation of free CN and free CHg 


65 


For the energy required to remove the first hydrogen atom from the methane 
molecule at 0°K, Kistiakowsky & Van Artsdalen (1944) give 100-8 ±lkcal. This 
quantity involves a figure for D(HBr) which needs some correction. From the 
molecular heat of formation of HBr and the heat of vaporization of bromine at 
25° C, 8-66 and 7-34 kcal. respectively (Rossini, Wagman, Evans, Blau & Levine 
1947), the heat of formation of HBr from hydrogen and gaseous bromine is 1 2- 3 3 kcal. , 
which, with thermodynamical data for Hg (Wagman et al. 1945) and for Brg and 
HBr (Gordon & Barnes 1933), corresponds to 12-22 kcal. at 0°K. Combining this 
with i)(Ha) and ^(Bra) (Herzberg 1939), jD(HBr) becomes 86-6 kcal., which corrects 
the figure given by Gaydon (1948). The figures employed by Kistiakowsky & Van 
Artsdalen is 85-8 kcal., so that jD(CH 3 — ^H) becomes 101-6kcal. at 0° K. This can be 
combined with the heat of formation of methane at 0° K (Prosen, Pitzer & Rossini 
194s) as foUows: 


2C(graphite) + liHj+iNa-9-CH3CN(gaB) - 21-4 

CH,CN(gas)->CHj + CN(X*S) -106-0 

CHj + H CH4(gas) +101-6 

- 51-6 

CH 4 (gas) -> C(grapliite) + 2Ha — 16-987 

C(graphite) + iNj^CN(X*2:) - 92-6 


The heat of formation of the CN radical can also be calculated from the heat of 
formation of ICN, D(L — CN) and ^(la) (Herzberg 1939): 

kcal. 


C(graphite) + JNg + il 2 (gas) ^ION{gas) - 47-1 

ICN{gas) + CN(Z 2S) - 62-6 

I(2Pj)^JI,(ga^) +17°8 

C(graphite) 4 - CN(Z ^S) - 91-8 


These two calculations from the absorption spectra confirm one another to well 
within the limits of experimental error. Presumably the spectra of CION and BrON 
would exhibit similar regions of continuous absorption, but these have not been 
examined at sufi&ciently short wave-lengths. 

The third method of calculating the heat of formation of CN depends upon an 
extrapolation of vibrational levels in the absorption spectrum of HON (Funke & 
Lindholm 1937) according to the method of Rydberg. The extrapolation leads to 
41,000 cm,“'^ or 117 kcal. for D(H. — CN). An indication of the reliability of the 
method is provided by the very similar case of acetylene (Funke & Lindholm 1937), 
where the value calculated for D(K — CgH) is virtually identical with that obtained 
from the continuum onset at 2350 A (Cherton 1942, 1943). (Here the result of the 
Rydberg extrapolation is some 25 % below that provided by the Birge-Sponer 
method which, as generally recognized, normally gives a figure considerably in 
excess of the true value.) Again it is possible to deduce the heat of formation of CN : 

kcal. 


C(graphite) + JHa + JlSTg -> HCN(gas) - 29*9 

IL(^S)^¥EL^^ + 51-6 

HCN(gas) + CN(X ^S) - 117-2 

C(grapliite) + iNg C3Sr(X ^S) - 96-6 


Voi. 198. A. 


5 



66 


L. H. Long 

The three independent figures thus obtained for the molecular heat of formation 
of free ON are in satisfactory agreement in view of the rather large possible errors: 



kcal 

continu-um onset for CHgCN 

-92-5 

continuum onset for ICN 

-91-8 

Rydberg extrapolation for HCN 

»-95-5 


Of these, the first two would normally be more rightly regarded as lower limits 
(algebraically speaking), but this would be absolutely certain only if the ON fluores- 
cence had been observed right up to the long wave-length limits of the respective 
continua of CHjCN and ION. Until this point has been examined in detail, there is 
also a possibility that these figures err somewhat on the high side. In the case of 
iodine cyanide, however, Yakovleva (1938) has provided further evidence which 
limits this possibility. When ION was illuminated with the light from a zinc spark 
conta ining the wave-lengths 2100, 2064 and 2026 A no fluorescence was observed. 
With an aluminium spark containing the lines 1990, 1935, 1864 and 1854 A, however, 
fluorescence was obtained. Here the type of emission was such as to indicate that the 
sum of the vibrational and rotational energies of the CN radicals produced was 
small, in contrast to the internal energies of the radicals produced by the hydrogen 
lamp. Allowing for a certain anaount of translational energy, this is strong evidence 
that the figure derived from the continuum limit around 2 1 00 A will not be very many 
kilocalories above or below the true value. 

In the following calculations the value — 92-5 kcal. will be adopted for the heat of 
formation of CN, less because it is the intermediate of the three figures than because 
the heat of formation of methyl cyanide is known with rather greater precision than 
is the case for the other two compounds. From this value and the heat of formation 
of cyanogen, J[)(NC — CN) at 0° K may be estimated: 

kcal. 

2C(graphite)+Nj-»-2CN(Z®S) - 186-0 

CjN2(gas)-v2C(graphite)+Nj + 71-4 

CjNjlgas) 2CN(Z »S) - 1 13-6 

From this, D(NC — CN) is seen to assume the value 113-6 kcal., a veiy reasonable 
figure lying approximately midway between the highly discordant extremes in the 
literature. No high accuracy is claimed for this figuje. Nevertheless, the foregoing 
evidence is scarcely compatible with the extreme values 77 (Elistiakowsky & Ger- 
shinowitz 1933) and 146 kcal. (White 1940). In going to press, it is interesting to 
note that Glockler ( 1 948 a) also re j ects these two values from a study of the respective 
lattice energies of sodium cyanide and potassium cyanide in conjunction .with the 
Bom-Haber cycle. The values Glockler thus obtains for Z)(N0 — CN) are 117-6 and 
119-6kcal. respectively, in good agreement with the figure calculated here, and 
hence with the value derived for the heat of formation of CN. 

The reasonable dimension of the figure obtained for D(NC — CN) is also indicated 
by a study of the reaction between cyanogen and hydrogen at high temperatures 
(Robertson & Pease 19426), since, as pointed out by Steacie (1946), the process 

H-i-(CN)j^HCN + CN 



67 


The heats of formation of free ON and free CHg 


must have a low activation energy in order that it may serve as a chain-carrying 
step. This could not be the case if i)(NC — CN) were much greater than jD(H — ON) 
(c. 117kcaL). In harmony with this, Robertson & Pease observed that the equi- 

Ha + (CN)2^2HCN 


lies experimentally far to the right, indioatmg that D{H — ON) is appreciably greater 
than the mean of i)(H2) and Z)(NC — CN). This -vvonld put an upper limit of about 
ISOkoal. on i)(NC — ON). These points provide further evidence against White’s 
high value which Springall (1947) favours. The figure 'derived in the present paper 
is further seen to be reasonable when compared withZ)(CH8 — ON) and Z)(OH3 — CHg), 
which latter value may be calculated from the heats of formation of methane and 
ethane at 0°K (Prosen, Pitzer & Rossini 1945), DiK^) and the figure 101-6 kcal. 
for — ^H) (see previously); 



kcal. 

CH3) 

DlCHj— CN) 

84-3 

105-0 

D(NC— CN) 

113-6 


The middle member has a rather higher dissociation energy than the mean value 
of the other two. This is usually observed for similar series. Here it is conditioned 
by the heats of formation of the three compounds concerned. Other workers have 
argued, in support of a higher value for 2 )(NC — CN), that conjugation eflFeets within 
the molecule will greatly strengtheij. the C — C bond. This is doubtless an important 
factor, but Walsh ( 1948) has drawn attention to another factor acting in the opposite 
direction. In the rather simil ar molecules glyoxal and diacetyl, even though con- 
jugation of the two carbonyl groups results iu the shortening of the central C — C 
bond, polarity effects offset a corresponding increase in the bond energy and dis- 
sociation energy of the C — C link, which, in the case of diacetyl at least, is more 
easily rupttgred than the C — C link in ethane. Similarly in cyanogen, as predicted by 
Walsh, the polarity of the C=N groups is such as to withdraw electrons frorn the 
C — C bond, so reducing the overlapping of bonding electrons and hence the bond 
energy. A confirmatory indication is supplied by the stretching force constants of 
the C — C bonds. Whereas those for CgHg and OH3ON, as calculated by Linnett 
(1940, 1941), are 4-53 and 6-3 x 10®dyne/om. respectively, the corresponding value 
for C2N2, namely, 5-22 x 10®dyne/om. (Herzberg 1945), differs little from that for 
CH3CN. We are here dealing with dissociation energies and not bond energies, but 
even allowing for the difference in reorganization energies of the CH3 and CN radicals, 
there is again no indication that Z)(N C — CN) will be much greater than D( CH3 — CN) . 

These conclusions constitute one point of difference with Springall (1947) who 
selects the high value 142 kcal. for i>(NC — CN). Goldfinger (1947) also criticizes 
Springall on this point. The other major point of difference is the value adopted for 
Z)(CN), which will be discussed later. 


The heat oe fobmatioh of methylene 

It would be relevant at this point to bring forward certain evidence regarding the 
heat of formation of methylene, CHj, since this provides an indication of the true 


5-a 



68 


L. H. Long 

value of D(CO). The latter quantity is related to the heat of sublimation of carbon 
(graphite) into ground-state atoms, L^, by the equation 

J»(CO) = 85-78 kcal. (2) 

Now, from equations given previously (Long & Norrish 1946c), 

Li = a-l-6 + c-fd-226'lkcal. 

at 25° 0, where a, b, c and d are the respective energies required for the processes: 
CH^-^-CHa+H-a, • CHs-s-CHa + H -6, 

CHg->CH + H -c, OH-^C(»P) + H-d. 

At 0° K the equation becomes 

-j- 6 "l" c -f" d “ 222*4 kcal. (^) 

Owing to lack of information concerning the value of b, this method has not so far 
been successfully apphed to the evaluation of L^. 

One method of evaluating the heat of formation of CHg and hence b is provided 
by the investigation of the homogeneous decomposition of methane. Failure to 
distinguish between the homogeneous and heterogeneous processes has led to con- 
troversy in the past. Many investigators have studied the heterogeneous decom- 
position, but Kassel (1932) seems to be the only one to have studied the homogeneous 
process. He carried out his experiments in a quartt vessel at pressures between 1 
and 40 cm. in the temperature range 700 to 860° C. The reaction velocity was not 
appreciably affected by increasing the surface area twentyfold. The decomposition 
was found to be unimolecular with an activation energy of 79 ± Okcal., the final 
products being carbon, hydrogen and a trace of oily matter. There was, however, 
an unexplained induction period for some of the experiments. Of the two possible 
initial processes, 

CH^-^CHs-l-H, (4) 

and 0H4->-0H2-t-H2, (6) 

V* 

the former seems to be excluded on energetic grounds, since for a homogeneous 
process the activation energy caimot be less than the energy absorbed, which for 
process (4) is 101 *5 kcal. at 0° K (see the previous section). The initial act must con- 
sequently be process (5), conclusions which are supported by Barrow, Pearson & 
Purcell ( 1939). On the other hand, there is a fair amount of evidence to show that the 
heterogeneous decomposition of methane is mainly concerned with process (4). 
Thus Eltenton (1947) in mass-spectrograph experiments with methane observed 
CH3 and not CHg; but with a deactivated (as distinct from an activated) surface and 
a pressure below 0*3 mm. he also failed to observe CH3, even at 1100° C, an obser- 
vation which may in some way be connected with the induction period observed by 
Kassel. Eltenton thus connects process (4) with the heterogeneous decomposition.* 


* G. C. Eltenton, private communication. 



The heats of formation of free CN and free CHg 69 

Kassel’s activation energy for process (5), the one -with which we are here concerned, 
corresponds to 76 kcal. at 0° K, and — 59 kcal. for the heat of formation of CHa: 

kcaL 

CH4->CH2 + H2 -75 

C(graphite) + 2 H 2 -f 16-987 

C(graphite) + H 2 CH 2 — 59 

The reversible reactions CH2 + CH^^OHg + OH3 

and CH2 + H2^0H3 + H 


( 6 ) 

(7) 


have already been used by Wicke (1942, 1945) for deriving the heat of formation of 
methylene. Experimental evidence for reaction (6) has been obtained by Eosen- 
blum (1941). Pearson, Purcell & Saigh (1938) also conclude that it occurs at high 
temperatures, though not appreciably at room temperature since methane scarcely 
affects the life of CHj. These facts are in harmony with its somewhat endothermic 
nature, and imply an activation energy in the range 10 to 20 kcal. The experiments 
of Rice & Glasebrook (1934) concerning the decomposition of diazomethane in the 
presence of butane also lead to an activation energy of 16 ± 6 kcal. for the very s imil ar 
reaction ^ ^ ^ 


The reaction between two methyl radicals may follow the paths indicated below: 


OH3 + CH 


r^CHa+CHi, 

(8) 


(9) 

^CaHg + H, 

(10) 

->C2He. - 

(11) 


Process (10) is endothermic and less likely than (8) or (9). This is born out by the 
experiments of Paneth, Hofeditz & Wunsch (1935) who examined the products 
obtained from methyl radicals, both with helium as a carrier gas and in the absence 
of a carrier gas. Process (10) can be eliminated since it would have resulted in the 
formation of propane and butane, which was not observed. Purther, since the 
CgH^: CH4 ratio was never too great to be 'explained entirely by process (8) (the 
reverse of reaction (6)) followed by the recombination of CH2 to ethylene on the walls, 
the indication is that reaction (9) is unimportant. Thus the main gas-phase reaction 
appears to be the disproportionation process (8), with process (11) occurring on the 
walls. Reactions between two radicals, unless they are endothermic, are usually 
found to have low activation energies. However, that of reaction (8) is not negligible 
since the decomposition of silver methyl (Semerano & Riccoboni 1941) produces 
ethane only and no methane or ethylene at low temperatures, in contrast to higher 
temperatures. Furthermore, Paneth et ah (193S) succeeded in increasing the half- 
life of CH3 to about 0*1 sec. In accordance with the foregoing, 8 ±4 kcal. would 
appear to be a fair estimate of the activation energy of reaction (8). 

Neither reaction (7) nor the back process requires a high activation energy. 
Rosenblum (1938, 1941) finds that methylene is much less stable towards hydrogen 
than towards methane. Prom this it follows that process (7) has a lower activation 



70 


L. H. Long 

energy than process (6), so that 10 ± 6 kcal. may he regarded as an approximate esti- 
mate for the former. In addition, the temperature dependence observed by Rosen- 
blum for the formation of saturated hydrocarbons requires a low activation energy 
for reaction (7). Ror the back reaction, which involves an atom and a free radical, 
a very low activation energy is to be expected. Accordingly, Trenner, Morikawa & 
Taylor (1937) postulate the process CH3 ■+■ D -> OHg -I- HD as a possible rapid step 
in the deuterization of methane, the alternative process OH3 •+■ D OH3D -I- H being 
regarded as unlikely by Gorin, Kauzmann, Walter & Eyring (i939)- Steacie (194^) 
accepts 6 kcal. as an upper limit for the activation energy of the reverse of reaction 
(7), so that 2-5 ± 2-6 kcal. may here be regarded as a safe figure. 

The differences between the respective activation energies of the forward and back 
processes for the two reactions (6) and (7) lead to — 7 and — 7-6 kcal. respectively for 
the approximate heats of reaction. These conclusions are similar to those of Wicke 
( 1945 ), and also receive support from Barrow et al.{igz9) who consider both reactions 
are nearly thermoneutral. These figures correspond to —77 and —78 kcal. respec- 
tively for the heat of formation of CH2; 

kcal. 


C(graphite) -h 2 H 2 CH« -I- 16‘987 

2CH,->CHt-t-CHj + 7 

2CH4->2CHg-|-2H -203-0 

2H-».H3 -I- 103-2 


C(graphite) -1- H* CHg — 77 

C(graphite) -f- 2H, -»■ OHj -f 16-987 

CHg-t-H-^-CHj-t-Hg 4- 7-5 

CHg-j-CHg-t-H -101 -6 

C(graphite) -f Hj CHj — 78 


Finally, there is evidence from the ultra-violet absorption spectrum of ketene. 
This exhibits a continuum over the range 2600 to 3700 A in which diffuse bands are 
to be distinguished, this region corresponding to the photolytio decomposition 
process CHaCO^CHa-i-CO (Norrish, Crone & Saltmarsh 1933; Norrish 1934). 
These results have received independent confirmation (Ross & Edstiakowsky 1934). 
The quantum yield is nearly unity, but falls off near the long wave-length limit. 
The point of onset of the continuum puts an upper limit of 77 kcal. on the energy 
required to effect the photo-decomposition. The molecular heat of formation of 
gaseous ketene may be calculated from the heat of reaction with dilute caustic soda 
(Rice & Greenberg 1934) and thermoohemical data for sodium hydroxide and sodium 
acetate in solution (Bichowsky & Rossini 1936) to be 16 kcal. at 18® C, corresponding 
to 14 kcal. at 0® K. Combining this figure with the heat of formation of carbon 
monoxide (Wagman et al. 1945), the figure -91 kcal. is obtained as a lower limit 
(algebraically speaking) for the heat of formation of CHa: 


kcal. 

2C(graphite) 4- Hg -l- CHjCO(gas) +14 

CHgCO(gas)->CHg + CO -77 

CO->C(graphite) + JOj -27-202 

C(grapliite) + Hg-^-CHj Tgi 



71 


The heats of formation of free CN and free CHg 

Owing to the rather large inaccuracies involved, close agreement is not to be 
expected between the results from the various methods of calculating the heat of 
formation of CHa- All the evidence is compatible with the figure — 70 + 15 kcal., and 
there is no indication of a decidedly different value. From this figure, b (equation (3)) 
is found to be 88 kcal. : 

kcal. 


CH^-4^C(graphite)H-2H2 - 15-987 

CHa+H-^CH^ +101-5 

Ha->2H -103-2 

C(grapliite) + H2->“CH2 — 70 


CHa->CH2+H - 88 


The value of d, that is, D(CH), has been calculated by Herzberg (1939) from obser- 
vations of band spectra by Shidei (1936) to be 80-0 kcal., a value in excellent accord 
with the expectations of Heimer (1932), and also with the general trend of C — H 
bond energies as related to bond lengths and force constants (Walsh 1947 a, 1948). 
The value for c, as pointed out formerly (Long & Norrish 1946c), could as a first 
approximation be put equal to d. From theoretical considerations, Voge (1936) 
calculates that c is 0-05 eV in excess of d. Accordingly, 81 kcal. may be taken as the 
‘most probable’ value for c, a figure not likely to be in error by many kilocalories. 
These values, in conjunction with equations (2) and (3), lead to 128 kcal. for and 
214kcal. forZ)(CO). The actual values given by the individual methods areasfoUows: 




Li 

D{CO) 



(kcal.) 

(kcal.) 

(i) 

homogeneotis decomposition of CH 4 

117 

203 

(ii) 

heat of reaction of CH 2 + CH 4 ^ 2 CH 3 

135 

221 

(iii) 

heat of reaction of CHa+Hg^CHg + H 

136 

2?2 

(iv) 

photo -decomposition of ketene 

:+149 

>235 


Methods (i), (ii) and (iii), individually as well as collectively, seem to be entirely 
incompatible with the high value 256-1 and the low value 169-7 kcal. which have been 
proposed for i)(CO) (see table 1, p. 74). 

It needs at this point to be mentioned that a recent investigation by an effusion 
method (Brewer, Gilles & Jenkins 1948) appears to indicate a relatively low vapour 
pressure for graphite and hence support the high value for D(CO), which is in 
definite conflict with the foregoing evidence. However, it has been pointed out 
elsewhere (Long 1948) that the results of Brewer et ah are also in conflict with other 
available evidence concerning the direct measurement of the equilibrium vapour 
pressure and triple point of graphite, and that these investigators themselves make 
observations concerning the rapid volatilization of graphite which are scarcely 
compatible with their own temperature-pressure relation and its impUed low 
vapour pressure. Because of the importance of all direct evidence, further comments 
are called for. Even when the first of the seven independent measurements which 
Brewer et ah record is ignored as discordant, the remaining six results are seen to ’ 
indicate vapour pressures which, allowing for the slight temperature modulation, , 
vary among themselves by a factor of approximately 10. Yet, from the 3;esults of 



72 


L. H. Long 

varying the area of the efFasion hole by a factor of only 2’77, it is concluded that the 
accommodation coefficient for graphite is sufficiently large to ensure the attainment 
of true equilibrium vapour pressures inside a crucible with an elfusion hole of dia- 
meter equal to one-quarter of the internal diameter of the crucible. The absence of 
a more thorough investigation of this fundamental point, upon which the validity 
of the results entirely depends, is obviously a serious omission, the more so because 
unpublished work of Johnston & Marshall referred to by Herzberg (1942) indicates 
an ‘exceedingly small’ accommodation coefficient. Brewer et al. further tacitly 
assume that all of the carbon vapour impinging on a cold platinum surface adheres 
to it, and that a true Maxwellian distribution for the vapour effusing (as demanded 
by the effusion equation employed) is realized under conditions where the mean free 
path is long compared with the dimensions of the enclosure. For a number of reasons, 
therefore, it would be imwise to place great emphasis on these results before they are 
confirmed by independent measurements of the equilibrium vapour pressure of 
graphite, preferably over a wider temperature range. (These criticisms do not apply 
to the determination of D(C2) also described by Brewer et al.) 

Apart from this instance, the indication of the evidence discussed in this section, 
namely that neither of the extreme values proposed for D(CO) is permissible, 
receives support from the bulk of other direct evidence concerning the equilibrium 
vapour pressure of graphite (Long 1948; Long& Norrish 1946 6, c). Theforementioned 
photolysis of ketene (method (iv)) is also particularly valuable evidence against the 
high value, and may be compared with the observation of Faltings, Groth & Harteok 
(1938) that ultra-violet radiation of wave-length 1295 A effects the photo-decom- 
position of carbon monoxide with a quantum yield of unity, thus placing an upper 
limit of 221 kcal. onD(CO), corresponding to 136 kcal. on Ly In this respect, Schmid 
& Gero (1946) have brought forward three independent arguments against an attempt 
(Gaydon & Penney 1942) to reconcile the latter evidence with the high value 
(266"1 kcal.) for Z)(CO). Chemically speaking, a value as high as this for the energy 
required to rupture a double bond (Long & Walsh 1947) would in any case be 
without precedent. Again, the process 

CH4->C+-l-4H-4e- 

is observed in electron-impact experiments at 26-7 + 0*7 eV (Smith 1937), which 
places an upper limit on the energy of atomization of methane and hence on L^. 
From this and three other processes of like type referred to by Hagstrum (1947), 
upper limits for have been calculated, all of which lie in the range 131 to 141 kcal. 
None of the evidence discussed in this section gives a sufficiently accurate figure to 
be able to decide conclusively between the alternatives 210-75 and 221 kcal. which 
have also been suggested for D(CO) (see table 1), although it is rather difficult to 
reconcile that from method (i) with the latter alternative. It is emphasized that the 
various calculations of Lj and X)(CO) by methods (i), (ii), (iii) and (iv) all depend 
upon a theoretically derived value for the step CH2-^CH4-H, but are otherwise 
entirely experimental. An error of several kilocalories here would not alter the general 
conclusions, which are in complete accord with the evidence cited previously (Long 
& Norrish 1946 u, b, c) ; nor likewise would the eventuality, discussed as a possibility 



73 


The, heats of formation of free CN and free CH2 

by Walsh ( 19476 ), that the triplet state of CHg should turn out to be about 5 kcal. 
lower in energy than the singlet state (Voge’s calculation of the quantity c referring 
to the latter). 


The kelationship between D(C 0 ), D( 0 N) and I>(N 2 ) 


From the derived heat of formation of ON, the value of x in equation ( 1 ) may be 
evaluated at 178-3 kcal. as follows: 


kcal. 


C0(ZiS)-^C(®P) + 0(3P) 
C(®P)+N(^/S')H.C]sr(X2S) 

CN(Z 2S) G(grapHte) + 
0(»P)->i02 

C(graphit6) + ->■ CO(X ’‘■h) 

D(CO)=D(C]Sr)-iD(N2) 


-D(CO) 

+£>(CN) 

+ 92-6 
+ 68-6 
+ 27-202 

+ 178-3 (lo) 


Equation (la) is entirely experimental. 

The spectrum of the ON radical has been examined by several investigators, but 
most recently and in greatest detail by Schmid, Gerii & ZernpMn ( 1938 ). By following 
a series of perturbations, the A *11 state has been observed up to the 30th vibrational 
sublevel. A short extrapolation leads to a convergence limit at 60,500 cm.“^ or 
172-8 kcal. above the ground state of ON. The convergence of the upper jB*S state 
is extrapolated to 66,600 cm."’-, that is, 6000 cm.“^ higher, the stated accuracy in 
each case being + 1000 cm.~^. 

Since neither carbon nor nitrogen has an excited level in the neighbourhood of 
5000 cm.~^ above the respective ground state, the nearest being C: ^Dat 10,192 cm.~^, 
it would follow that the A *11 state of CN is not derived from ground-state atoms. 
Accordingly, Schmid efaZ. ( 1938 ) conclude that the relevant combination of atomic 
states is C: *P + N; *P. If this interpretation is correct, the energy required for the 
process 

aN(Z*S)->C(*P) + N(i/S) 

is 31,660 cm.-i or 90-6kcal., in contradistinction to the much higher value adopted 
by SpringaU ( 1947 ). (Employing the newly observed level for the state of carbon 
(Shenstone 1947 ), the value obtained from the P*S convergence to C:®jS+N:^S' 
at 66,500 cm.~^ differs from this by only ~ 100 om.“^.) Substituting this value for 
P(CN) in equation (la), the latter becomes: 

I)(CO) = 268-8 kcal.- iI)(N 2 ). (16) 

The values of D{CO) and ^(Na) which are at present receiving support are given 
in table 1 . For the sake of consistency, the spectroscopic values here quoted have been 
recalculated from the predissociation limits at 89,620 ± 60 cm.“^ for CO (Schnoifi & 
Gero 1935 ; Gero 1936 ) and 97,960 ± 40 cm.-’- for Ng (Biittenbender & Herzberg 1934 ) 
by subtracting the relevant excitation energies and converting to kUocalories. The 
conversion factors employed are those compiled by Herzberg ( 1944 ) from the review 
of Birge ( 1941 ). The figures quoted for Valatin are based on the work of Schmid & 
Gero ( 1937 , 1943 )- In the case of CO, these authors consider that the products of 



74 L. H. Long 

dissociation for the limit at 89,620 cm."’- are either C:®/S' + 0:®P or 0:®P + 0:’/J?. 
The difference amounts to only a few cm."’, the figure quoted here being based on 
the former assumption. Hagstrum’s figures are the electron-impact values, that for 
nitrogen being essentially the same as Herzberg’s spectroscopic value. 


Table 1 



D(C0) 

D(Nj) 


(kcal.) 

(kcal.) 

Gaydon ( 1947 ) 

256-1 

226-0 

Hagstrmn ( 1947 ) 

-221 

~171 

Herzberg ( 1939 ) 

210-76 

170-1 

Valatin (1946 a) 

169-7 

116-1 


Of the possible values listed in table 1, only two pairs will provide a reasonable 
sa-bisfaction to equation (16), namely, 

D(CO) = 210-76 and D{lSz) = 116-1; 

D{CO) = 169-7 and I)(N2) = 226-0. 

(In neither case does the discrepancy with the equation amount to as much as 
4kcal.) 

This result is surprising and brings certain difficulties with it. Whereas, from the 
evidence already discussed, 210-75 kcal. is a permissible value for i)(CO), the low 
value 116-1 kcal. for jD(N 2) is difficult to accept on the grounds that it leads to im- 
possibly low dimensions for the bond energies in many nitrogen compounds, par- 
ticularly that for the N — bond energy in hydrazine. On the other hand, the 
alternative pair of values entails the very low figure 73-9 kcal./g.-atom for the heat of 
atomization of graphite, a figure at serious variance with the evidence already brought 
forward, as well as -with direct measurement of the vapour pressure of carbon, which 
indicates a much higher value. 

This forces us to look once again at the means we have employed in arriving at 
these alternative conclusions. In calculating equation ( 1 6) three experimental values 
have been adopted, namely those for D{0^), P(CN) and the heat of formation of the 
ON radical. Since the last-named value has been derived by three independent 
methods which agree to -within the limits of experimental error, both among them- 
selves and with the calculations of Glockler (1948a), and since the value of D{Oi) 
appears to be satisfactorily established, of the three quantities adopted that for 
D(C!N) is the most likely to be in error. Although the scientific world has not yet been 
presented with an alternative analysis of the ON spectrum comparable in detail 
with that given a decade ago by Schmid et cd. (1938), it would be useful at this point 
to reverse the calculations we have made and deduce from equation (la) the dimen- 
sion of 2)(CN) corresponding to each of the possible combinations of D{00) and 
i>(N2). The results are given in table 2. The possible values of D(ON) obtained from 
the lowest known convergence limit at 60,500 cm.-’ by assuming different products 
of dissociation are given in table 3. 



75 


The heats of formation of free CN and free CHg 


Table 2. Values oe D{GN) calculated erom possible combinations 
OE i)(CO) AND ^(Ng) (in KCAL.) 



225-0 


-D(CO)\ 



256-1 

(a) 

190-3 

221 

{d) 

155 

210-75 

(?) 

144-9 

159-7 

( 3 ) 

93-9 


170-1 115-1 


(b) 

162-8 

(c) 

135-3 

(e) 

128 

(/) 

100 

(h) 

117-5 

(^) 

90-0 

(k) 

66-4 

( 1 ) 

38-9 


Table 3. Possible values oe D{CN) erom the convergence 
LIMIT AT 60,500 ± 1000 CM 


dissociation, products 

D(CN) 

carbon 

nitrogen 

(kcal.) 

sp 


172-8 

W 


143-8 

sp 

2D 

117-9 

I/S' 

-^/S' 

111-0 

3p 

2p 

90-5 


The evidence discussed here in the section on the heat of formation of methylene 
points unequivocally against cases a, 6, c, h and I (table 2). A comparison of tables 
2 and 3 shows that cases a, k and I are doubly ruled out, the former because the highest 
possible value of D(CN) is too low, and the latter two for precisely the reverse reason, 
there being no likelihood that ON dissociates into products which are more highly 
excited than those suggested by Schmid et ah (1938). It would thus seem, quite 
independently of the correct dimension of D(CN), that the values for D(CO) and 
^(Ng) proposed by Gaydon are not mutually compatible, nor likewise those favoured 
by Valatin. Among the main grounds of support for case a have been theoretical 
ones based on the Non-crossing Rule (Gaydon & Penney 1945). But a rigid applica- 
tion of this rule involves an identification of the potential curves of the unperturbed 
electronic states of the molecules concerned with those of the eigenvalues provided 
by the classical two-centre model. As soon as the relative motions of the nuclei are 
considered, this identification is no longer permissible (Valatin 19466). It is therefore 
not surprising that the Non-crossing Rule has led to incorrect conclusions, especially 
as other diatomic molecules are known which do not conform to it. 

Of the remaining six cases, only g, h and i provide values for i)(CN) which are 
close to any of the possible values given in table 3. Cases i and /are unsatisfactory 
because, as already mentioned, the low value for ^(Ng) does not give a reasonable 
figure for the N — ^N bond energy in hydrazine. To get round this difficulty it would 
be necessary to assume that the hydrazine molecule were derived from excited 
nitrogen atoms. It is not possible to make an immediate decision between possi- 
bilities g and h. The spectroscopic value 170*1 kcal. for i)(N2) is that which receives 
support from electron-impact experiments. Furthermore, the alternative figure 
225 kcal. implies a high value for i)(NO) which Flory & Johnston (1946) have criti- 
cized, mainly because it is barely to be reconciled with the results of experiments on 
the photo-decomposition of nitric oxide. Nor is it compatible with the appearance 



76 


L. H. Long 

potential of N+ from nitric oxide (Hagstrum 1948 ). On the other hand, considerations 
concerning the relative energies of nitrogen bonds have led certain investigators to 
prefer 226 koal. for ^(Na) (Skinner 1945 ; Glockler 19486 ). But in tliis connexion it 
should be noted that in his semi-empirical relationships Glockler compares other 
properties of carbon bonds with bond energies calculated from the ®P state of carbon. 

In both cases g and h, however, there is a difficulty with respect to the ON spec- 
trum. If D(CN) is 117-9koal., which would follow from case h, the products of 
dissociation at the convergence limit at 60,600cm.~^ must be 0:®P-hN: This 

identification of products was tentatively suggested in the appendix of an earlier 
paper (Long & Norrish 1946 c), and supported on the additional grounds that the 
upper convergence at 65,600 cm.~^ could be assigned the products 0 : 
provided the excitation energy of C: ^8 were about 24,200 cm.“^ or 69-2 kcal. This 
value for the C: excitation energy, against which there was at the time no pub- 
lished experimental data, seemed to provide a confirmation of the thermoohemioal 
estimate of the energy level of the tetravalent state of carbon, namely 66 ± 10 koal. 
The recent observation of the ^8 level by Shenstone ( 1947 ) at 96-4koal., however, 
apart from appearing to prohibit the identification of this thermoohemioal estimate 
with the energy of the 0 : ®P transition, creates a real difficulty when 1 1 ?• 9 kcal. 

is considered for D(CN), since there is no combination of atomic terms which fits the 
upper convergence of ON at 66,600 ± 1000 cm.'’-, or lies even near to it. This argument 
applies with even greater force to the possibility that 2)(CN) is 143'8 koal. (case g). 

The reason for this seeming incompatibility is not clear, but, in the event that 
Z)(CN) has the value 117'9kcal. (case h), the possibility arises that the upper j5®S 
state of ON is derived from a hybridized tetravalent carbon atom with an internal 
energy around 69 koal. above the ground state, this maintaining its identity in ON 
up to a high vibrational level*, in spite of the fact that it does not correspond to a single 
atomic state. In this case the difficulty caused by the apparent convergence at 
66, 500 om.~^ would be removed. In any eventuality, some revision of the dissociation 
scheme of ON due to Schmid et al. seems to be called for, even though table 2 supports 
the contention of these investigators that ON does not dissociate into ground-state 
atoms at 172-8 kcal. 

Cases d and e both presuppose an error of some 10 kcal. or more in the heat of 
formation derived for the CN radical, which is more than is likely. It thus appears, 
quite apart from the additional objections of Valatin ( 1948 c), that the electron- 
impact value 221 kcal. for i)( 00 ) is less probable than the spectroscopic value 
210-76kcal., but it is not possible in our present state of knowledge to decide 
definitely against it. 


CONCLTrSIONS 


The calculation of the heat of formation of free CN by a number of independent 


D(CO), i)(CN) and ^(Ng), as well as an unambiguous value for 2)(NC — CN). 

The three spectroscopic values for the dissociation energies of CO, CN and Nj 
favoured by Valatin (1946 a) are neither mutually compatible nor individually 
acceptable. 



The heats of formation of free CN and free CH 2 77 

The spectroscopic values for i)(00) and i)(N2) preferred by Gaydon (1947) by 
reason of the Non-crossing Rule are likewise mutually incompatible. 

The most probable value for i)(CO) is 210-75kcal., corresponding to 125*0 kcal. 
for the heat of atomization of graphite at 0° K. This value is supported by independent 
lines of evidence concerning the heat of formation of methylene and its bearing on 
the energy involved in the stepw’ise dissociation of methane. The evidence examined 
here is decidedly incompatible with the high value 266-1 kcal. and the low value 
159*7 kcal. for D(CO). The electron-impact value 221 kcal. appears as a less likely 
alternative to the spectroscopic value 210-75 kcal. 

Of the three values proposed for D(^ 2 )> 115-1 kcal. appears to be definitely too 
low to be a possible figure. The relationship between jD(CO), D(CN) and 
cannot decide between 170-1 and 225-0 kcal. until D(CN) is definitely fixed. 

From the convergence of CN at 60,500 cm.'”^ D(CN) is either 117-9 or 143-8 kcal., 
according to whether i)(N2) is 170-1 or 225-0 kcal. 

The author’s grateful acknowledgments are due to Professor R. G. W. Norrish, 
Dr A. D. Walsh and Dr G. C. Eltenton for helpful correspondence regarding 
certain of the points discussed in this paper. 

Appendix 

Towards the unequivocal fixing of these important quantities, in particular the 
heats of atomization of carbon and nitrogen, it would be useful to close the paper by 
enumerating some of the ways by which the problems might be profitably approached : 

(i) a precise determination of the limiting wave-length capable of effecting the 
photo-decomposition of carbon monoxide; 

(ii) further direct measurement of the vapour pressure of graphite, including the 
determination of the niaximum temperatures to which it is possible to heat a graphite 
rod electrically under varying pressures; 

(iii) a calculation of the lattice energy of diamond (this would presumably provide 
the heat of atomization to carbon atoms in the tetravalent state); 

(iv) the application of a modified Trouton’s Rule to graphite; 

(v) a determination of the heat of formation of diazomethane which, in con- 
junction with the energy absorbed in the photo-decomposition (Klirkbride &; Norrish 
1933), would provide an upper limit for the heat of atomization of graphite, as in the 
similar case of ketene; 

(vi) further photochemical studies of simple cyanides, preferably in conjunction 
with a further analytical study of the ON spectrum; 

(vii) combined photochemical and thermochemical studies of a number of simple 
molecules such as O2H2, C2CI2, HNg, NHg, N2H4 and the recently discovered N2F2; 

(viii) further studies of the homogeneous thermal decomposition of methane 
and ammonia; 

(ix) determinations of the heats of reaction of processes involving OH3, CHg, 
CH, NH2, NH and CN radicals. 

Note added 23 December 1948. Since the completion of this paper, other articles 
have appeared which have a close bearing on its contents. A revision of an earlier 



78 L. H. Long 

theoretical treatment of the stepwise dissociation of methane in the light of more 
recent experimental data has been given by Voge (1948), who concludes that the 
energies required for the successive removal of hydrogen atoms are a — 101,6 = 00, 
c = 80 and d = SOkcal., values not involving a priori assumptions concerning the 
heat of atomization of carbon. In no case do these figures differ by more than 1 or 
2 kcal. from the values adopted here. This provides from a theoretical angle excellent 
support for the magnitude of 6 ( ~ 88 kcal.) derived entirely from experimental data 
in the present paper. (The alternative figures of Walsh (19476) involving a triplet 
ground state for CHa, namely a = 101, 6 = 86, c = 85 and d = 80 kcal., do not differ 
greatly from those of Voge.) 

Quite apart from his own considerations, Voge also cites data concerning the 
appearance potential of CH2+ from methane (Smith 1937) and the ionization energy 
of CHa (Langer & Hippie 1946), from which the heat of formation of CHa at room 
temperature may be calculated to be — 70 kcal., as follows ; 

koal. 


CHi-»CHa++Hj + e- -362 

CB[a++e-->CHa +274 

C(graphite) + 2 Hj CH4 + 17'9 

0(graphite)+Hj-»-CH2 — 70 


This corresponds to —71 kcal. at 0°K and is further experimental evidence in 
support of the figure 70 kcal. already derived in a number of independent ways . 

In direct conflict with the foregoing, a paper by Gero (1948) on the stepwise dis- 
sociation of methane and a number of articles by Valatin (1948 a, 6, d) on allied 
topics extend the former contention of Schmid & Gero (1937), already referred to 
here, that (with modem conversion factors) D(CO) = 169'7 and Lj = 73'9kcal. 
Bearing in mind that a and d are experimentally fixed, this would mean that in the 
dissociation scheme of methane the sum of 6 and c is more than 50 kcal. less than 
that adopted both in the present paper and in the paper of Voge (1948). However, 
by making the arbitrary and unnatural assumptions (i) that even in the CH radical 
carbon always remains in the tetravalent state, and (ii) that only ^8 carbon 
atoms and not atoms in the ground state participate in the equilibrium present 
in carbon vapour, the values 100*76, 97*36, 97*02 and 97*02 kcal. are obtained for 
the respective energies required to remove successive hydrogen atoms in methane 
(the second, third and fourth of these quantities being ultimately defined in a different 
maimer from 6, c and d as employed in this paper). The close agreement of the first 
figure with the experimental value of D(CH3— H) is obtained only at the expense 
of further tacitly assuming (iii) that the C— H bond energies in CHj and OHa groups 
within saturated organic molecules are identical with those in free CH3 and free CHa 
respectively. This last assumption may involve serious errors and is in itself mis- 
leading, since it means, for example, that there remains no evidence to support the 
contention that the process 

CH(*S)->C(«fi') + H 

absorbs 97*02 kcal., as is asserted. Again, it also involves identifying bond energies 
with dissociation energies, for example the C— 0 bond energy in ethane with 



79 


The heats of formation of free CN and free CHg 

i)(CH3 — CH3), which is not permissible (Long & Norrish 1946c). With regards 
assumption (i), it is hard to see why the carbon in CH, which involves only a 
single bonding electron from the carbon atom, should not revert to the divalent 
state, there being no kinetic, spectroscopic or energetic evidence to suggest that 
it remains tetravalent. Assumption (ii), in order to explain the discrepancy with 
observed equilibrium vapour-pressure measurements, involves the further im- 
probable assumption that the carbon atoms are in equilibrium with a hypothetical 
excited singlet state of the Cg molecule, which cannot revert to the triplet ground 
state and is regarded as possessing a quadruple bond (!) whose energy is estimated 
to be 185kcal. by extrapolation. However, these assumptions concerning strongly 
metastable atoms and molecules meet with serious objections, since the atomic 
lines of the transition have been observed in emission (Shenstone 1947) and 

the molecular lines emitted by carbon vapour inside a graphite tube below 3000° K 
are the ordinary triplet-triplet Swan bands (Brewer et al, 1948). Without these 
admittedly ingenious if extravagant assumptions which the Budapest school have 
introduced to defend a not unequivocal interpretation of the CO spectrum, there 
appears to be little if any independent evidence to support their interpretation. 
These assumptions alone are in any case not sufficient to reconcile the interpretation 
with all known facts, including the experimental data discussed in the present paper. 


References 

Badger, R. M. So Woo, S.-C/ 1931 J. Amer. €hem. Soc. 53, 2572. 

Barrow, R. F., Pearson, T. G. & Purcell, R. H. 1939 Trans, Faraday* Soo, 35, 880. 
Berthelot, M. 1875 Ann, OMm, {Phys,), (v), 5, 433. 

Berthelot, M. iSSi Ann, Chim, {Phys,), (v), 23, 262. 

Bichowsky, P. R. <fe Rossini, F. D. 1936 The thermochemistry of the chemical substances, 
New York: Reinhold. 

Birge, R. T. 1941 Bep, Progr, Phys. 8 , 90. 

Brewer, L., Gilles, P. W. & Jenkins, F. A. 1948 J, Chem, Phys, 16, 797. 

Biittenbender, G. & Herzberg, G. 1934 Ann, Phys,, Lpz., (v), 21, 677. 

Cherton, R. 1942 Bull. Soc, Sci. Liige, 11, 203. 

Cherton, R. 1943 Bull. Soc. Chim, Belg. 52, 26. 

Cutler, J. A. 1948 J. Chem. Phys. 16, 136. 

Eltenton, G. C. 1947 J. Chem, Phys. 15, 455, 

Faltings, K., Groth, W. & Harteck, P. 1938 Z. phys. Chem, B, 41, 15. 

Flory, P. L. & Johnston, H. L. 1946 J. Chem, Phys. 14, 212. 

Fimke, G. W. & Lindholm, E. 1937 Z. Phys. 106, 518. 

Gaydon, A. G. 1947 Dissociation energies and spectra of diatomic molecules. London: Chapman 
and Hall. 

Gaydon, A. G., 1948 Nature, 161, 731. 

Gaydon, A. G. & Penney, W. G. 1942 Nature, 150, 406. 

Gaydon, A. G. & Peimey, W. G. 1945 Proc. Boy. Soc. A, 183, 374. 

Gerd, L. 1936 Z. Phys, 100, 374. 

Gerd, L. 1948 J. Chem. Phys. 16, 1011. 

Giauque, W. F. 1931 J. Amer. Chem, Soc. 53, 507. 

Gillespie, L. J. & Fraser, L. H. D. 1936 J. Amer. Chem. Soc. 58, 2260. 

Glockler, G. 1948 a J. Chem. Phys. 16, 600. 

Glockler, G. 19486 J. Chem. Phys. 16, 602. 

Goldfinger, P. 1947 Bull. Soc. Chim. Belg. 56, 282, 

Gordon, A. R. 1937 J . Chem. Phys. 5, 30. 

Gordon, A. R. & Barnes, C. 1933 J. Chem. Phys. 1, 692. 



80 


L. H. Long 

Gorin, E., Kauzmann, W., Walter, J. & Eyring, H. 1939 J. Ohmn, Phya, 7, 633, 

Hagstrum, H. D. 1947 Phya. Eev. (ii), 72, 94:7. 

Hagstrum, H, D. 1948 J. Chem, Phya. 16, 848. 

Heim, G. 1933 Bull. Boo. CHm. Belg. 42, 467. 

Heimer, T. 1932 Z. Phys. 78, 771. 

Herzberg, G. 1939 Molecular spectra and molecular structure. I. Diatomic molecules. New 
York: Prentice-Hall. 

Herzberg, G. 1942 J. Chem. Phys. 10, 306. 

Herzberg, G. 1944 ^iomic spectra and atomic structure (2nd ed,). New York; Dover, 
Herzberg, G. 1945, Infrared and Raman spectra of polyatomic molecules. New York: Van 
Nostrand. 

Herzberg, G. & Scheibe, G. 1930 Z. phys. Ghem. B, 7, 390. 

Hogness, T. R. & Ts’ai, L'.-S. 1932 J. Amer. Chem. 80c. 54, 123. 

Kassel, L. S. 1932 J. Amer. Chem. Boc. 54, 3949. 

Ketelaar, J, A. A. & Kruyer, S. 1943 Rec. Trav. chim. Pays^Bas, 62, 550. 

Kharasch, M. S. 1929 Bur. Stand. J. Res., Wash., 2, 359. 

Kirkbride, F. W. & Norrish, R. G. W. 1933 J. Chem. Boc. p. 119. 

Kistiakowsky, G. B. & Gershinowitz, H. 1933 J. Chem. Phys. 1, 432. 

Kistiakowsky, G. B. & Van Artsdalen, E. R. 1944 J. Chem. Phys. 12, 469. 

Danger, A. & Hippie, J. A. 1946 Phys. Rev. (ii), 69, 691. 

Lemoult, P. 1909 C.R. Acad. Bci., Paris, 148, 1602. 

Linnett, J. W. 1940 J. Ghem. Phys. 8, 91. 

Linnett, J. W. 1941 Trans. Faraday Boc. 37, 469. 

Long, L. H. 1948 J. Chem. Phys. 16, 1087. 

Long, L. H. & Norrish, R. G. W. 1946 a Nature, 157, 486. 

Long, L. H. & Norrish, R. G. W. 19466 Nature, 158, 237. 

Long, L, H. & Norrish, R. G. W. 1946 c Proc. Roy. Boc. A, 187, 337. 

Long, L. H. & Walsh, A. D. 1947 Trans. Faraday Boc. 43, 34k 

Mooney, R. B. <fc Reid, H. G. 1931 Nature, 128, 271. 

Mooney, R. B. <fc Reid, H. G. 1932 Proc. Roy. Boc. Edinb. 52, 152. 

Nemmin, H. H. & Terenin, A. N. 1936 Acta physicooMm. U.R.B.B. 5, 465. 

Norrish, R, G. W. 1934 Trans. Faraday Boc. 30, 103. 

Norrish, R. G. W., Crone, H. G. & Saltmarsh, O. 1933 J. Chem,. Boc. p. 1533. 

Paneth, F. A., Hofeditz, W. & Wvinsoh, A. 1935 J. Chem. Boc. p, 372. 

Pearson, T. G., Purcell, R. H. & Saigh, G. S. X938 J. Ghem. Boc. p. 409. 

Prosen, E. J., Pitzer, K, S. <& Rossini, F. D. 1945 Bur. Stand. J. Res., Wash., 34, 403. 

Rice, F. 0. <&; Glasebrook, A. L. 1934 J. Amer, Chem. Boc. 56, 2381. 

Rice, F. O. & Greenberg, J. 1934 J. Amer. Chem. Boc. 56, 2268. 

Robertson, N. C, & Pease, R. N. 1942a J. Chem. Phys. 10, 490. 

Robertson, N. C. & Pease, R. N. 19426 J. Amer. Chem. Boc. 64, 1880. 

Rosenblum, C. 1938 J. Amer. Chem. Boc. 60, 2819. 

Rosenblum, C. 1941 J. Amer. Chem. Boc. 63, 3322. 

Ross, W. F, & Kistiakowsky, G. B. 1934 J. Amer. Chem. Boc. 56, 1112. 

Rossini, F. D., Wagman, D. D., Evans, W. H., Blau, E. J. <fe Levine, S. 1947 Selected values 
of chemical thermodynamic properties. Washington: National Bureau of Standards. 
Schmid, R. F. & Gero, L. 1935 Z. Phys. 93, 656. 

Schmid, R. F. &; Gero, L. 1937 Z. phys. Ghem. B, 36, 105. 

Sctoid, R. F. <fc Gero, L. 1943 Matematikai is Termiszettudomanyi iirtesUd, 62, 416. 

Schmid, R. F. & Gero, L. 1946 Proc. Phys. Boc. 58, 701. 

Schmid, R. F., Gero, L. <& Zempl6n, J. 1938 Proc. Phys. Boo. 50, 283* 

Semerano, G. & Riccoboni, L. 1941 Z. phys. Chem. A, 189, 203. 

Shenstone, A. G. 1947 Phys. Rev. (ii), 72, 411. 

Shidei, T. 1936 Jap. J. Phys. 11, 23. 

Skinner, H. A. 1945 Trans. Faraday Boc. 41, 645. 

Smith, L. G. 1937 Phys. Rev. (ii), 51, 263. 

Springall, H. D. 1947 Trans. Faraday Boc. 43, 177. 

Steacie, E. W. R. 1946 Atormc and free radical reactions. New York: Reinhold 
Stevenson, D. P. 1939 J. Chem. Phys. 7, 171, 

Terenin, A. N. & Nenimin, H. H. 1934 Nature, 134, 265. 



The heats of formation of free CN and free CHg 81 

Terenin, A. N. & Nemmin, H. H. 1935 J. Ohem, Phys, 3, 436. 

Thompson, H. W. 1941 Tram, Faraday Soc. 37, 344. 

Thomsen, J. 1905 Z. phys. Chem. 52, 343. 

Trenner, N. R., Morikawa, K. & Taylor, H. S. 1937 J. Chem, Phys, 5, 203. 

Valatin, J. G. 1946 a J, Ohem. Phys. 14, 568. 

Valatin, J, G. 19466 Proc. Phys. Soc. 58, 695. 

Valatin, J. G. 1948a Rev. sci., Paris, 86, 135. 

Valatin, J. G. 19486 J. chim. phys. 45, 123. 

Valatin, J. G. 1948 c Phys. Rev. (ii), 74, 350. 

Valatin, J. G. 1948 c? J. Ghem. Phys^ 16, 1018. 

Voge, H. H. 1936 J. Ghem. Phys. 4, 581. 

Voge, H. H. 1948 J. Ghem. Phys. 16, 984. 

Wagman, D. D., Kilpatrick, J. E., Taylor, W. J., Pitzer, K. S. & Rossini, F. D. 1945 Bur. 

Stand. J. Res., Wash., 34, 143. 

Walsh, A. D. 1947 a Trans. Faraday Soc. 43, 60. 

Walsh, A. D. 19476 Faraday Soc. Discussions, 2, 18. 

Walsh, A. D. 1948 J. Ghem. Soc. p. 398. 

White, J. U. 1940 J. Ghem. Phys. 8, 502. 

Wicke, E. 1942 Ergehn. exakt. Naturw. 20, 1. 

Wicke, E. 1945 Die Ghemie, 58, 16. 

Yakovleva, A. V. 1938 Acta physicochim. U.R.S.S. 9, 665. 


The behaviour of waves on tidal streams 

By N. r. Baebbb, The Admiralty Research Laboratory, Teddington, Middx. 
{Communicated by G. E. B. Deacon, F.R.8. — Received 4 November 1948) 

The paper discusses the maimer in which waves change their characteristics when they pass 
through regions where the water has a streaming motion. The treatment applies to tidal 
streams, the velocity of which depends both on time and position. Some experimental 
evidence is provided in support of the theory. 


Introduction 

Since early in 1945 wave records from the coast of Cornwall have been submitted 
to frequency analysis for the purpose of studying the generation and propagation 
of waves and swell. Some results have been reported by Barber & UrseU ( 1948 ). 
The evidence indicates that in a storm, trains of waves are generated of a variety 
of wave-lengths up to a maximum wave-length depending on the wind strength 
and that each wave train advances across the ocean with a speed approximately 
equal to ? 5 r/ 47 r, which is the group velocity indicated by theory for waves whose 
period is T. The separation between. the longest and shortest waves travelling in 
the same direction must therefore increase with the distance travelled from the 
generating area, and because of this dispersion the first wave trains to arrive at 
a distant recording station wiU exhibit a long natural period. Wave trains of 
shorter period arrive later, and it is to be expected that the swell arriving at the 
coast will show a period which decreases continuously with time. 


Vol. 198, A. 


6 



wave period (sec.) 


82 N- F. Barber 

The frequency spectra of waves recorded at Pendeon and Perranportl) ilhistrate 
this behaviour, but they show that the period of the swell arriving at tlio coast 
do^s not vary exactly in accordance with the simple theory. The swell from a 
distant storm is observed to have a period which fluctuates by as much as ± 1 sec. 
in cycles of 12 ^ hr. about the smooth curve drawn through the observations to 
represent the general trend towards shorter periods. This fluctuation is evident in 
the curves of figure 1, which are based on observations made about the time of 
spring tides. 



Figube 1. The maximum and minimum periods limiting tlio 24 to 14 sec, fro(|ueu«y band in 
the wave spectra of 14 to 19 May 1945 at Perranporth. 


The fluctuation in period has been attributed to the effect of tidal streams 
through which the swell had to travel in the last 200 miles of its journey to the 
coast. The present aim is to discuss this effect in more detail. 

Theory of the steady state 

Unna (1942) has discussed the steady state of a wave train entering streams 
whose velocity does not change with time. He shows that where the stream has 
a velocity u the waves may be expected to exhibit a new wave-length A and 
velocity c given by the rule 

^/Co = '^/(C + tt), 

"wliere the zero subscript denotes the values of these quantities for that part of 
the wave train which is on slack water. The equality is obtained by assuming that 
an equal number of wave crests per unit time must pass any fixed observer. The 
argument presumes that it is physically possible to have a train of waves, extending 
from slack water to moving water, whose waves retain their identity, no new ones 
appearing and none disappearing. No description of such a train has yet been 



83 


The behaviour of waves on tidal streams 

given in terms of a velocity potential, and this may suggest that such a train cannot 
exist; there may, for instance, be some reflexion of the wave train. But if the 
existence of the wave train is assumed, and if the wave-length and wave velocity 
of any part are related by the usual equation, the changes in velocity or length 
may be predicted; in deep water the rule 

A = ^nc^jg 

enables the first equation to be written in the form 

0 = (c/Cq)^““C/Cq — W/Cq, 

which is equivalent to formulae due to Unna. The equation describes two types of 
wave train : 

(1) If c/Cq is greater than the wave train is one which may extend into slack 
water; it cannot extend into an opposing stream whose velocity is greater than 
Jcq. Before this point the waves steepen and break. 

(2) If c/cq is less than the wave train is one which rides on an opposing stream 
whose velocity is greater than the group velocity, |c, of the waves. The train 
cannot extend into slack water; before it arrives there the waves steepen and 
break. This wave train might be generated as the wake of a boat moving up 
stream. 

The arguments have been extended by Johnson (1947) to the two-dimensional 
problem of waves crossing obliquely the boundary between two currents. He infers 
that when the waves approach the boundary obliquely they may fail to cross it, 
their energy being dissipated by reflexion or in breaking. 

The noh-steadt problem 

It will be appreciated that the tidal variations of period observed at Pendeen 
and Perranporth cannot appropriately be dealt with by steady-state theory. The 
velocities of the tidal streams change considerably during the time that the swell 
takes to cross the area of streams lying between the deep sea and the coast. It is 
desirable to develop some treatment suitable for waves on streams whose velocity 
changes with time. 

Unna (1941), discussing the behaviour of short waves riding on swell, treats the 
short waves as if they expanded and contracted with the water mass on which 
they move. The short waves then attain their greatest length as the trough of 
swell passes them and attain their smallest wave-length when they are overtaken 
by the crest. This somewhat intuitive treatment can be justified by reducing the 
problem to the steady state. A uniform velocity impressed on the system will 
bring the swell to rest and it may be regarded as slight variations in space, but not 
in time, of a rapid stream on which the short waves move. This leads to the 
formulae given by Unna. 

The idea that waves expand or contract with the water mass can be justified, 
however, by another means which is found to provide a solution to the general 
problem. This treatment is given below, and its conclusions will be seen to be in 
fairly good agreement with experimental observations. 


6-2 



84 


N. F. Barber 


KrfrBMATios or waves on streams 

The classical theory of water waves shows that if the height of the waves is 
sufficiently small the system can always he regarded as due to the superposition 
of a sufficient number of elementary wave trains, each consisting of long paraUel 
crests equally spaced and moving with a speed appropriate to the wave-length. 
In the resulting interference pattern the elementary waves lose their identity, and 
the more complicated phenomena of group motion, in which waves appear and 
disappear as they travel in groups, take their place. 

In discussing waves on streaming ,w®ter two assumptions will be made. By 
analogy with the theory of waves on slack water it will be assumed that a com- 
plicated wave motion on streaming water can be looked upon as being the super- 
position of a sufficient number of elementary trains which could have an indepen- 
dent existence and in which the wave crests would retain their existence as they 
progressed through the streaming water. Whether such elementary wave trains 
are possible on streaming water is not known, but the analogy with classical wave 
theory makes it plausible. 

It will also be assumed that in each of these elementary trains of waves the 
length of the wave and its velocity relative to the water are related by the equation 
of classical theory „ , 

c®=9'^tanh-j-, (1) 


where c = wave velocity relative to the water mass, A = wave-length, g * ac- 
celeration due to gravity, h = water depth. 

This equation is strictly applicable only to the waves of an elementary train in 
slack water, but it seems likely that it may apply to an elementary train on 
streaming water provided: 

(o) that the changes in stream velocity are small during a very large number of 
wave periods; 

(6) that the stream velocity is very nearly constant over a large number of 
wave-lengths; 

(c) that the streaming velocity is sensibly uniform over a depth equal to half 
a wave-length. 

With these assumptions it is possible to consider the one-dimensional case 
illustrated in figure 2. The velocity u of the stream is prescribed at each point x 
as a function of the time t. An elementary train of waves is present on the surface, 
and the length A and velocity c of the waves will in general be functions of both 
the position x and time t. 



Figube 2. Eonematios of waves on. streams (one-dimensional problem). 



85 


The behaviour of waves on tidal streams 


Considering a wave crest which is at position x at time t, its velocity relative to 
the water is c and relative to the fixed system of co-ordinates is {c + v,). The Ntb 
wave in succession from this one will be at a distance NX from it, and the velocity 
of this wave will differ slightly from the velocity of the first one, the difference 

NXd{c + u)ldx. 


This relative velocity may also be expressed as the time rate of change of the 
distance between the two crests. The value of A depends both upon x and t, and 
since the waves move forward a distance {c + u) St in a brief interval St, the rate of 
change of NX with time is 

N{dXldt + {c + u) dXjdx), 


The two expressions for relative velocity may be equated to give the relation 


1 

A 



9(ch-w) 
dx • 


( 2 ) 


Waves on water which is deep or op constant depth 


The formulae (1) and (2) may now be combined, but the precise form of 
equation (1) is hot involved at this stage, and it is only necessary to observe that 
the velocity c has been assumed to be a function of A, g and h and not a function, 
for instance, of the space and time derivatives of these quantities. 

In the first instance it will be assumed that the waves are in water whose 
depth is uniform and constant, or else that the water in which the waves are 
travelling is deeper than a wave-length. The equation (1) then shows that we may 
write 


dc 

dx 


dc dX 
dX dx ’ 


This relation when substituted in equation (2) gives 


1 

A 


r3A / , dc\ 9A“1 


du 

dx' 




The right-hand side of this equation measures the rate of expansion of the 
water surface which is produced by the streaming motion. The left-hand side is 
the fractional rate of increase of wave-length with time ; the expression does not 
refer to a given set of waves, but to those waves which happen to be in the vicinity 
of an observer who moves relatively to the water with a velocity 


c — XdcjdX, 


This expression is the usual definition of the group velocity of the waves in 
classical theory. In this problem, therefore, as in many others, the group velocity 
enters into the equation with a peculiar significance. It appears that if an observer 
follows a given wave group through moving water, the average length of the waves 
in the group expands or contracts at the same rate as the general surface of the water 
upon which the group is moving. It will be appreciated that this result determines 



86 


N. F. Barber 


the whole behaviour of the waves, for the velocity and period may be inferred when 
the wave-length has been found. 

It is convenient to use a special symbol to denote the time rate of change of 
some characteristic of the waves in a particular group, and using the notation 


Dt ~ dt 


(«+c-A~) 


1 

dx’ 


the equation (3) may be written as 


XDt 3* ‘ 


(4) 

( 6 ) 


The OENEBAIi CASE 

I 

It is clear that in tidal regions the water depth is a function both of place and 
time, and the effective value of gravity to be used in equation (1) may also vary; 
the waves are riding on a mass of water whose vertical acceleration must modify 
the effective force of gravity on the waves. It can be shown that these two effects 
rarely appear simultaneously; the changes in gravity are of importance in dis- 
cussing the behaviour of wind waves riding on long swell in water whoso depth 
is comparable with the length of the swell, and the changes in depth are of im- 
portance in discussing the behaviour of waves on tides. For the sake of generality, 
changes in both h and g will be discussed. 

The wave equation (1) is a relation between the quantities c. A, g and h, and if 
the wave period is denoted by T, where 


y = A/c, 


it may be shown that the following relations hold between the partial differential 
coeffioients . W 3 i\ / \ \ 


Iwaa AVSg'/yftV \9A/j/i/’J 


( 6 ) 


where the subscripts denote the quantities held constant during the differentiation. 
The quantity c— A(3c/3A)^ft in these expressions is recognizable as the group 
velocity and will be denoted by 0. 

Substitution may now be made in the kinematic relation (2). The quantity 
dcjdx on the right-hand side of this equation may be expanded as 


3c / ( 

h\ 3A /3 

dh /c 

lc\ 




Wxh 


di 

dx’ 


and using the equations (6) the kinematic equation becomes 


1 DA _ 3m 1 /3A\ ^dh ^ 1 /3A\ dg 
XDt dx A\dhj^g dx x[dgj^^ dx' 


(7) 


This expression for the ffactional rate of increase of wave-length in a wave 
group is similar to the equation (6) derived previously, but it includes two terms 



87 


The behaviour of waves on tidal streams 

which show the effects of changes in depth and in effective gravity. This may be 
regarded as the fundamental equation governing the change of character of the 
waves of a group as the group advances through streaming water. From the 
wave-length the period or velocity or group velocity may be inferred, but it is 
possible to obtain the changes in these quantities explicitly. Thus if M denotes one 
of the quantities c,T ox Q it can be shown that 



A reasonable physical interpretation may be given to equations (7) and (8). 
From equation (7) it will be seen that the time derivatives of g and h do not affect 
the wave-length, and so long as the waves remain on a mass of water which is all 
behaving in a very similar way the changes in wave-length are produced solely 
by the contraction or expansion of the water surface ; changes in gr or A only affect 
the wave-length in so far as the group experiences them as a result of moving with 
velocity 0 on to new masses of water where g ox h are different. When moving on 
to new water, the changes in wave-length occasioned by changes in or A take 
place as if the period of the waves remained constant. Thus if waves are present 
on the surface of water in a tank which suddenly begins to accelerate upwards or 
downwards the length of the waves will not change, but the period will alter 
appropriately to the new value of g\ when waves pass into shoaling water which 
has no streaming motion the period remains constant, but the wave-length alters 
appropriately to the new depth of water. The other wave characters experience 
changes which may be inferred from these changes in wave-length or period. 

Changes in the apparent period 

A wave characteristic which is not included in the general treatment of the 
previous section is the apparent period of the waves as recorded by a stationary 
observer. This will be written as 

T’* = A/(c-f-t6), (9) 

and it is this apparent period which is measured in the frequency spectra oi waves 
recorded by an instrument fixed upon the sea bed. 

The differentiation of equation (9) gives 

1 DT* 1 DX 1 (Do Du\ 

T* Dt XDt (c+u)\Dt^ Dt)’ 

where the operator DjDt has the significance previously given to it in (4). The 
quantities DXjlH, DcjDt and DujDt may be obtained from equations (7), (8) and 
(4), and upon substitution of these quantities the equation governing the apparent 
period appears as 


1 DT* 

1 

~du Ic 

ic\ dh (c 

)c\ dg 

T* Dt ~ 

(u+c) 





( 10 ) 



88 N. r. Barber 

This equation shows that the changes in the apparent period depend only upon 
the time derivatives of u, g or h; if, therefore, a wave group passes through an 
area of sea in which the streaming motion is everywhere constant with time, the 
apparent period of the waves in the group remains unchanged throughout its 
progress. The equation shows how the apparent period will change if the waves 
pass through a tidal area in which u, g and h vary with time. 

When wave groups complete their passage through a tidal area in a small 
fraction of a tidal cycle the change in apparent period will be proportionately 
small. Comparing two wave-measuring stations screened from the ocean by 
different widths of continental shelf, it may be inferred that the corrections which 
must be made to the measured periods in order to obtain the true wave periods 
in the open sea will be smaller for the station with the narrower continental shelf, 
flaanming that the stream velocities and tidal ranges are similar in the two cases. 
The proposed wave measuring stations at Casablanca and Wellington, N.Z, are 
likely therefore to provide a simpler picture of the state of the sea than do the 
stations in Cornwall. 


A PEACTIOAL INSTAKOB OB THE TIDAL VARIATION IN APBARENT PERIOD 

(a) 8weU travdling from the Southern Ocean 

The equations (6), (7) and (8) derived in the previous sections make it possible 
to deduce what changes will take place in waves when they cross an area of sea 
in which the stream velocity and the depth of water are known as functions of 
time and position. In any practical case it is unlikely that the stream velocity 
and depth can be expressed as analytic functions of x and t, and a graphical 
method is more suitable for the integration of the equations. This method has been 
used to estimate the total variation to be expected in the apparent period of 
a wave train having a period of 18 sec. in the open sea, which arrives at Perran- 
porth, Cornwall, after being generated in the region of the Falkland Islands. The 
fluctuations observed experimentally have been shown in flgure 1. 

Figure 3 is a refraction diagram for such waves ap they cross the continental 
shelf, and the fuU line represents the path of a wave group arriving at Perranporth. 
Figure 4 shows the velocity of the tidal streams at various positions on this path 
throughout the tidal cycle, the value used being the component of the stream 
velocity parallel to the Mne of the path; these values have been estimated from the 
Admiralty Atlas of tidal streams. Most of the curves of figure 4 are approximately 
sinusoidal, and irregularities are probably due to errors in interpolating from the 
Atlas. They axe more complicated in the region of Perranporth, for the tidal 
streams on the north Cornish coast appear to show a double maximum flood 
stream. Figure 5 shows the mean depth of water at points along the path, values 
of the tidal range in depth and the time interval by which high water at various 
points along the path precedes high water at Dover ; this information is obtained 
from the Ad mir alty Atlas of tides, and high water at Dover is used as a reference 
time so as to conform with usual practice. 



89 


The, behaviour of waves on tidal streams 

The data of figures 4 and 5 were used in the integration of equation (10) to 
obtain the total change in apparent period which the waves would show at Perran- 
porth; only the first two terms on the right-hand side of this equation were con- 
sidered. The values of the quantity dujdt were obtained for points at intervals of 
10 miles along the path by taking differences between successive hourly values 
of the stream velocities in figure 4. These values were written in array upon a 
diagram whose axes were the distance from Perranporth and the time relative to 
high water at Dover. On this diagram straight lines were drawn to represent the 



Figure 3. Refraction diagram for waves crossing the continental shelf from the direction 
215°. Depth contours in fathoms. Broken lines show every 100th wave crest. Full 
line shows the path of a group arriving at Perranporth. 

progress of a wave group arriving at Perranporth at various hours before and after 
the time of high water at Dover, the gradient of the lines being appropriate to a 
uniform velocity of the group equal to 32 knots. Some approximation is involved 
here, since the different depths of water along the path lead to theoretical values 
of the group velocity ranging from 33*5 to 30*0 knots, and the tidal streams them- 
selves are of the order of a knot and affect the progress of the group over ground. 
These differences mean that the position of a wave group at hourly intervals 
would be somewhat different from the positions obtained by the assumption of 
a constant value for the group velocity, and the values of dujdt at these positions 




90 


N. F. Barber 


hours before RWDover hours after 



Graphs against time, of the com- 
ponent of velocity of tidal stream (at 
springs) in the direction of -wave pro- 
gress from Falkland Islands, past 
Land’s End to Perranporth. The 
curves are based on the Admiralty 
charts of tidal streams issued for 
hourly intervals relative to high 
water at Dover, The horizontal lines 
are at intervals of 1 knot on the 
vortical scale. The times at which 
the streams are zero arc indicated 
by small circles. 


iT -5 of tidal stream at various points upon the path in 

figure 3. The velocity plotted is the component of stream velocity along the path 




91 


The behaviour of waves on tidal streams 

would differ in consequence; it was considered, however, that assumption of a 
constant group velocity of 32 knots would not lead to any serious error, Prom 
the diagram it was possible to infer the values of dujdt which the groups woidd 
experience at equal intervals during their progress and to obtain, by addition, the' 



Figure 5. Mean depth of water at various points along the path shown in figure 3. a, mean 
depth of water; 6, range of tide at springs; c, hours by which high tide precedes the time 
of high tide at Dover. 



Figure 6. Estimated values of apparent period of waves originally of 18 sec. period which 
travel across the continental shelf in a direction 036° as in figure 3, and arrive at Perran- 
porth at various tidal times, a, total value; h, effect of tidal stream velocities; c, effect 
of changing water depth in tides. 




N. F. Barber 


92 

integrated value of the first term in equation (10). The changes in apparent period, 
due to the term dujdt, which the various groups would exhibit on arrival at 
Perranporth are shown by the broken line b in figure 6. It seems likely that much 
of the erratic nature of this curve is due to errors in interpolation in reading from 
the Atlas. The large fluctuation at 2 hr. after high water at Dover may be real 
however, since it is assooiaited with the double flood stream near the north Cornish 
coast. 

The integral of the second term in equation (10) was obtained somewhat differ- 
ently. The TYin.-!rinmTn value of the quantity dhjdt at various positions on the path 
was obtained from the tidal ranges shown in figure 4 on the assumption that the 
rise and fall of water-level was sinusoidal with tidal period, and the values of the 

coefficient -(1?^ were evaluated for the various positions on the path using 

C \dhj Xg 

the mean water depths shown in figure 6 and assuming a value of 18 sec. for the 
wave period. The quantities were then compounded in a vector diagram, the 
amplitudes of the vector elements being the maximum values of the quantity 

1 /^\ dh 
c Xdhjxgdt 

obtaining at the various positions on the path, and the angles of the vector elements 
being the hour angle by which a wave group arriving at Perranporth at the time 
of high water at Dover would pass each of the positions prior to the time of 
maximum dhjdt at that position. The summation of these elements in the vector 
diagram showed that the greatest inorease in apparent period due to this cause 
would occur in waves arriving at Perranporth 2J- hr. before high water at Dover, 
and it would amount to an inorease of 0-28 sec.; for the remainder of the tidal 
cycle it is indicated by the sinusoidal broken curve c in figure 5. 

The sum of the two calculated changes is shown as the full line a in figure 0. 
Disregarding certain irregularities in the curve it appears that the overall fluctuation 
in apparent period should be about 2-3 sec., and the greatest period should be 
shown by waves arriving at Perranporth about 3 hr. before the time of high water 
at Dover. This is »in fair agreement with the fluctuations in the apparent period 
evident in figure 1. 

(6) Swell travelling from the west 

Calculations similar to those above have been made to find the change in 
apparent period that might be expected in swell arriving at Perranporth after 
having crossed the continental shelf from the west. The calculated curves are 
shown in figure 7; these suggest that the greatest increase in period would be 
about 0-6 sec. and would be shown by swell arriving at Perranporth about half an 
hour before the time of high water at Dover. The swell generated in storms in the 
North Atlantic does not usually show such regular and well-defined fluctuations 
in period as does swell coming from the Southern Ocean, and it can only be said 
that the calculated changes shown in figure 7 are not in violent disagreement with 
the observations. 



93 


The behaviour of waves on tidal streams 



Figtjrb 7, Estimated values of apparent period of waves originally of 18 sec. period which 
travel across the continental shelf in a direction 090° and arrive at Perranporth at 
various tidal times, a, total value ; 6, effect of tidal streams ; c, effect of changing water 
depth in tides. ^ 

CON-CLFSIONS 

Fluctuations in tidal cycles of the period of waves recorded by a stationary 
instrument can satisfactorily be attributed to the action of tidal streams. In deep 
water the average length of waves appears to expand or contract at the same rate 
as the general surface of water on which they are moving. The change in period 
will be proportionately small when the tidal streams are weak or where the waves 
complete their passage through the tidal area in a small fraction of a tidal cycle. 

The author wishes to express his indebtedness to P. J. H. Unna for a number 
of suggestions on this and on allied subjects. The data of figure 1 are based on 
frequency analyses made by J, Darbyshire. The paper is published by permission 
of the Chief of the Royal Naval Scientific Service. 


Rbeerenoes 

Barber, N. F. & Ursell, P. 1948 Phil. Trans, A, 240, 527. 
Johnson, J. W. 1947 Trans. Amer. Oeophys. Un. 28, 867. 
Unna, P. J. H. 1941 Nature, 148, 226. 

Unna, P. J. H. 1942 Nature, 149, 219. 

Unna, P. J. H. 1947 Nature, 159, 239. 



Kinetic theory of diffusion in gases and liciiiids 
I. Diffusion and the Brownian motion 

By L. M. Yang, Defartment of Mathematical Physics, University of Edinburgh 

{Communicated by M. Bom, P.B. 8. —Received 25 November 1948— 
Revised 10 February 1949) 

In the present paper the phenomenon of diffusion is examined in the light of the theory of 
the Brownian motion. The coefficients of self-diffusion, ordinary diffusion and thermal 
diffusion are expressed in terms of the first and aooond moments of certain transition pro- 
babilities in the theory of the Brownian motion. It is thou found possible in gasos of 

low or moderate density where a fairly well-defined free path exists to follow the future noursn 
of a given molecule statistically to as many free flights as required provided the volooity 
distribution of the molecules in the medium is known. Tlvis consideration on the ono baud 
leads to a rigorous expression for the coefficient of self-diffusion directly calculated from 
a MaxweUian distribution, and on the other serves to clarify the relation between the ohl 
free-path theory of gaseous diffusion and the rigorous theory of gaseous diffusion and between 
self-diffusion and mutual diffusion. Further, an approximate theory of diffusion in liquids 
corresponding to the old free-path theory in gasos is suggested. 


1. iNTEODtrOTION 

The kinetic theory of diffusion in gases has been approached in two different ways in 
the past. The first was by means of the free-path theory which, though unable to 
give exact quantitative results owing to the assumption that molecules after collision 
are distributed according to the Maxwellian velocity distribution in non-uniform 
gases, provides a simple, vivid picture of the complex molecular transport pheno- 
mena in gases (Jeans 1925). The second one due to Chapman and Bnskog was to deter- 
mine the deviation from the Maxwellian velocity distribution in a non-uniform gas 
by solving Maxwell and Boltzmann’s collision equation (Chapman & Cowling 1939). 
This has been successful in most respects for gases at low density, but the generaliza- 
tion for the condensed phase leads to almost insurmountable difficulties. In the 
present paper the phenomenon of diffusion is examined in the light of the theory of 
Brownian motion (Chandrasekhar 1943). It is found that the coefficient of self- 
diffusion in a simple gas or a gas mixture can be rigorously calculated by treating 
the gas molecules as Brownian particles. Consistent with the assumption of a con- 
stant coefficient of diffusion it is shown in the theory of the Brownian motion that 
the mean square displacement is proportional to the time interval provided that the 
time interval is small, so that the mean square displacement is maorosoo pically small 
but at the same time large enough for the particle to suffer a large number of collisions 
with the surrounding molecules. The constant ratio is obviously a physical property 
of the medium in which the particle moves. The relation of this constant ratio to 
molecular data, which is left untouched in the theory of Brownian motion, can be 
obtained only if one can follow the motion of particle within the time interval 
described above. By taking adirantage of the simplified features of a gas, it has been 
found possible to formulate rigorously the probability of a series of successive flights 

[ 94 ] 



95 


Kinetic theory of diffusion in gases and liquids, I 

and to follow statistically the future course of a given molecule for as many successive 
flights as one requires. Thereby the above-mentioned ratio is explicitly expressed 
in terms of molecular data and is shown to approach a plateau value as the number 
of free flights occurring in the interval becomes large. The present method of cal- 
culating the coefficient of self-diffusion has the advantage that it requires only the 
equilibrium velocity distribution and is easily extended to apply to mixtures of 
more than two components. The same method applied to a non-uniform gas brings 
out the essentially different nature of self-diffusion in uniform fluids and mutual 
diffusion in non-uniform fluids, reveals the nature of approximation inherent in the 
free-path theory, and enhances the necessity for obtaining exactly the local true 
velocity distribution, consistent with the existing non-uniform state of the fluid. 

In spite of the different nature of self-diffusion and mutual diffusion mentioned 
above, the latter can always be expressed approximately in terms of the former; the 
old free-path theory is just a simple example here. A simple explanation of the 
phenomena of thermal diffusion is here obtained. It also suggests that in the liquid 
region a corresponding approximate theory is possible, provided that the coefficient 
of self-diffusion in a uniform liquid mixture is known. The strict theory of diffusion 
in liquids, however, wiU require finding the true local velocity distribution. This 
will be undertaken in part II. 

2. Diffusion and teansition feobability 

The transport of molecular properties in a non-uniform fluid proceeds in two 
different ways: the first is by means of the migration of molecules from one plate to 
another carrying with them the local properties — the densities, the mass velocity 
and the temperature — ^this may be called the kinetic part of the transport process; 
the second way, called the potential part of the transport process, is due to the action 
of intermolecular forces. Diffusion is decidedly the simplest of all transport processes, 
as it involves only the transport of number densities. The process of diffusion 
in a mixture can be described in all cases by means of a transition probability 
^^(x. Ax, i, A^) defined as the probability that a molecule of the ith kind in the 
mixture at x and at time t will suffer a displacement Ax in time A^. depends in 
general on x and t through the local parameters (A^ = T, Ef) and their space 
.gradients of all order where is the number density of the ith kind of molecules, 
T the temperature, u the mass motion velocity and E^ the potential energy of the 
ith kind of molecule in the external force field. The time interval A^ is supposed to 
be small, so that the mean square displacement is small by macroscopical standards 
but at the same time large compared to the interval between individual collisions 
in the case of gaseous diffusion, and for diffusion in liquids it may conveniently be 
identified with the mean life of a molecule in a given site. In both cases the velocity 
of the molecule at the end of the time interval is no longer correlated with that at 
the beginning of the interval, and the displacements in successive intervals are 
therefore independent of each other. 

Concerning the local parameters it can be said that these are the least number of 
local macroscopic variables adequate for a complete specification of the macroscopic 



96 


L. M. Yang 


state of a Kquid mixture in the vioinity of a given point inside the fluid. Seeing that 
a drop of liquid mixture is hut an ensemble of several kinds of molecules, besides 
their intrinsic properties such as the molecular masses potential functions 
between pairs of molecules, one can further specify the average number of molecules 
per unit volume of each kind (w^), their average momentum (u), average kinetic 
energy (T) and the potential energy of each kind of molecules in the external force 
field (Ei). 

For the purpose of calculating the coefficient of diffusion it will be seen in the 
following that only the first and second moments ((Sk)^, (’Ax)f) of the transition 
probability are required. In the case of self-diffusion it is well known (Einstein 
1905) that the coefficient of self-diffusion of the ith kind of molecule in an 
equilibrium mixture can be expressed as 


l(Ax)l 
At ’ 


( 1 ) 


where 


(Ax)f = J^f(Ax, At) (Ax)®d(Ax), 


( 2 ) 


the integration being extended over aU values of (Ax). in the’ integrand depends 
only on Ax, At and not on x and t, as the A/s are constant in a uniform fluid. For 
mutual diffusion in general one considers a ‘normal’ non-uniform fluid in which the 
Aj’s vary smoothly in space. (Hereafter we limit ourselves to a binary mixture.) 

Let n^{x, t) dx be the probability of finding the ith molecule of the first kind at 
t in X, dx. Then one can express the probability distribution of a given molecule in 
t + At in terms of that at t and ^(x, Ax, t. At), viz. 


n^{x,t+At) = Jni^\x-Ax,t)^i(x-Ax,Ax,t,At)d(Ax), (3) 

the integration being extended over the whole volume occupied by the fluid. Equa- 
tion (3) can be expanded on both sides on account of our assumption made con- 
cerning At, 

At^wf (X, t) -f- 0(At)2 

= - ^ • ^^(x, t) t)(AxAx)i] -i- O(AxAxAx), (4) 

where i 

= I Ax, t. At) (Ax) ci!(Ax), ^Ax)i = J Ax, t. At) (Ax Ax) d(Ax). (6) 

Assuming that both (Ax)i and (Ax Ax)i are of the order of At and neglecting terms 
of higher order than At, one obtains from equation (4) 

|4«(x, <) = - 4 • p j + 44 = “j’ 

Pi At = (Ax)ij 2aiAt = (AxAx)i. 

* The a in clarendon type represents a tensor quantity. 


where* 


(7) 



Kinetic theory of diffusion in gases and liquids. I 97 

Since and hence depends on the space gradients of Aj at x, t which are assumed 
to be small, we may expand with respect to the gradients, i.e. 

, a, = (8) 

where 1 stands for the unit tensor. As terms of higher order than the second in the 
gradients on the right-hand side of (6) will be neglected, we require for terms 
involving at most the first-order gradients and for only the first term in (8), 
viz. Jail . Equation (6) can then be written in the form 

|?4«(X, = [4^^(x, t) PJ +1 A . t) aJ. (9) 

It follows from the equation of continuity for 7 i^{x,t) that the local flow 
t) ui®(x, t) of the probabihty density »4^(x, f) is 

( 10 ) 

Up to the present, only the probability of finding a particular molecule at a given 
position and time is considered. To relate it with the local partial density «i(x, f) 
of molecules of the first kind, one can imagine that at some given instant when the 
mixture is already in its normal state, the Xfs are known throughout the fluid, and 
in particular that the positions of all the molecules are exactly known; the transition 
probability of each molecule is found by treating the rest as a continuous medium, 
with the Aj’s known everywhere. At any later instant t, the probability of finding the 
volume element x, dx occupied by any molecule of the first kind must be equal to 
the sum of the probabilities that it is occupied by each of the molecules of the 
first kind, i.e. j/, 

( 11 ) 

i=l 

Besides, a^ and depend only on x, t and are independent of which, molecule of 
the first kind is chosen for consideration. The total local flow of molecules of the 
first kind is - 

( 12 ) 


of which a part is due to the local mass velocity u and the rest is the interdiffusion 
velocity u^. Hence 

/ V 19/ V r./ 19ai d-xdn. 

where Pi = Pi— u. (14) 

Proceeding in the same way, one obtains for molecules of the second kind 




Uz 9»2 


* 3 9x 3?i2 9x ’ 

where Pi, ag and have analogous meanings. 

The mutual diffusion velocity (u^— ui) follows from (13) and (16); 


Vol. 198. A. 


7 



98 


L. M. Yang 


IVoDCL til© abov© ©quation on© can rsgard. th© total mutual diffuaion volocity 
(ui — uQ as consisting of three parts: the first, — Pa), is the dift’erenco of the rates 
' ^ 10 

of drift of the two kinds of molecules ; the second, “ 3 ^ («i “ «a) < is the gradient of 

, , , . , . j l/ai9% aa9na\ . ,, 

th© difference of the rates of dispersion; and the third, n^dx)' *** 

contribution ffomself-dififusion in the presenceof density gradients. As the evaluation 

of the /?'’s requires the real velocity distribution in a non-uniform fluid, the most 
that one can expect, without knowing the deviation from the Maxwellian distribu- 
tion, is to determine the second and third parts in (16), and this is just what the old 
free-path theory was intended to do, though the persistence of motion has never 
been properly taken care of. This will be discussed in more detail in § 3. 

In the following we give th© formal expressions of the coefficients of ordinary 

diffusion Do and thermal diffusion Dy in terms of pj, p^, aj and a^. As pjj and pj are 
linear in the gradients, so are da^jdx. and da^jdx, and the only components which are 
of interest in the present calculation are in the directions of dnjdx, dnjdx and dTjdx ; 


one can write 






3x’ 


dai_docidni ScCidT 

9x ” 9%! 9x 9 w 2 ^ 3T 9x ’ 


(17) 


with similar expressions for p^ and doc^ldx. That the /?W’s (i *= 1,2,3) are simply 
scalars is required by th© fact that they can only be functions of n^, % and T. 

In order to select the part of the diffusion velocity duo to the presence of a pressure 
gradient, one transforms the gradients 9%/9x, dn^fdx and dTjdx to 9ci/9x, dpjdx 
and dTjdx, where = %/» and n = % -t-nj, while p — l>i+Pi is the total pressure: 


9ci _ %W2 / 1 9«i 1 ^2^ ^'P '^P ^Tdp 

9x "" n® 9x n^dx/' 9x ~ 9x dn^ 9x 9712 9» 97' ’ * 

Hence one can express 9%/9x and dnjdx in terms of 9ci/9x, dpjdx and dTjdx: 


where 


9% 

9x 

dn^ 

9x 




“9»29x 


A 

^9% 






(19) 


(20) 


In the experimental observation of diffusion in liquids or gases the pressure is 
usually kept constant, thus one can take dpjdx = 0 in (19). Substituting (19) in (17), 
one has \ 


Pi = 



A I %> I^T_ 



( 21 ) 


Soil _ 9(^1, ^2) 9ci rda^ 2ia^ dp~[ dT 

9x " Sip gx'^LsT 


with similar expressions for P2 and doc^jdx. 



Kinetic theory of diffusion in gases and liquids. I 


99 


Equation (16), after inserting the formal expressions of Pi, pi, dccijdx and dajdx, 
becomes 

(ui-ui) = (j 3 ( 2 ) 1 9(ai-ga.y) a^dp\]dCi 

^p[ 9«2 ^ 9«.i 3 0(%,W2) 3 \?Ii9w2 %9%/| 9x 

+ ((/»?>-«>)- [%(^>-/!?>) +%(;8f 

Comparing with the usual definition of the coefficients of ordinary diffusion Dq 
and thermal diffusion 


one obtains finally 

n = (i gK-o^2.y) , 1 ^ ^ ^ \ _ r. ^(u _ ^(1)^ ^ (2) _ ^(2)s ] 

(3 0(%,W2) 3\%9n2 n^dnj 0«2 ^ ^^J/’ 


(23) 


Dy — 




^ dT ^ 9 


(24) 


(ai-aj) 


T—p 


2p 


(25) 


In a dilute gas p = wfeT, one has 


(27) 


3. Statistics of the motion of gas molecules 

In the present section it will be shown that one can actually follow the path of 
a single molecule statistically, assuming the real velocity distribution of the medium, 
as a function of x and t, to be known. Advantage will be taken of the particularly 
simple feature in gases of low or moderate density, where the assumptions of mole- 
cular chaos and binary encounters are justified. 

Strictly speaking one can define rigorously the free path only for rigid spheres. 
For any general force law the total cross-section calculated classically is always 
infinite owing to the weak interaction of the distant molecules, while in the quantum 
theory it is finite. For the present purpose we may cut off the interaction beyond 
a reasonable distance with negligible error, and it can then be shown that the final 
results oti, ocg, etc., are practically independent of the range of interaction which we 
have chosen at the beginning, so long as it is not too short (see § 4). 


7-2 



100 L. M- Yang 

Complicated as the motion of a gas molecule is, it consists of only two kinds of 
events: it either travels freely or suifers a collision, with a certain probability of 
being deflected into a particular range of velocity after collision* Two probability 
functions which govern these two kinds of event will be defined first (in fact, it is 
necessary to determine only one probability, because one of them can be obtained 
from the other): 

(1) - probability that a molecule at x of velocity at time t will 
have suffered a collision in t, dt 

(2) t) = probability that a molecule at x of velocity at time 

t will have been deflected into in t, dt 

It will be shown in appendix A that both ju^ and x be calculated for any force 
law with finite range. 

It is obvious from the definitions of [i and % ^^bat /6 is obtainable from thus 


X, t) = X, t) ( 28 ) 

Both n and x depend only on the instantaneous velocity of the selected molecule 

at time t and not on its history; it does not matter how long the molecule has travelled 
with that velocity before reaching x at t. In a gas in equilibrium /i and x 
independent of x and ^ is a function of ] | only and of | | ^ | ^(*) | and & only, 

$ being the angle between and 5 ®- 

In dealing with a series of successive flights, i.e. motions between encounters, we 
shall denote by x^*") and 5 ® the time, position and velocity vectors at the beginning 
of the ith flight and by = | < 4 -*^ | the time interval between the ith and ^‘th 
encounters, where the ith encounter occurs at the beginning of the ith flight. 

Consider first the probability that a molecule at x® with velocity at time ti will 
travel freely for and suffer a collision in Ti^, Denoting this probability by 
W (5®, X®, «i, Tia) dTia, one has 

F(|®,x®, dTia = {1 F(|®,xa), t,, T) dTj,«(5®, X®, <a) dr, a, (29) 


where the first factor on the right-hand side gives the probability that it will not 
collide before = t-^+Tx% and the second factor the probability that it will collide 

7 'i 2> dTj_a. 

Dividing both sides of the above equation by /i(5®, x®, ia) and differentiating with 
respect to T^a, one obtains 


*..1 /.(5“.x®>,i,) } - - 

Hence after integration one has 

TF(?® xCi),fi,Tia) = /«(?®,x® yexpj^- J%(?® x®-H5^iV,«i-HT)drJ, (30) 

using the initial condition 


B^(?®,x®, ii, 0) = /t(5®,x®, f,). 



101 


Kinetic theory of diffusion in gases and liquids. I 

One notices here that the probability that a molecule of velocity at will 
travel freely for an interval not less than without collision is according to (29) 

(l-Jo = expj^^ + (31) 

Next the statistics of a complete flight is considered. (By a complete flight is 
meant the flight of a molecule between two successive collisions. The correct definition 
of the mean free path should be based on this concept. One sees clearly at this point 
the difference between Maxwell’s free path which is in accord with the present 
definition and Tait’s free path which is not.) Letting x(^>, t^, r^g) 
be the probability of finding a molecule at x^^^ dx^^^ at time t-^ with its velocity in 
^( 1 )^ ^^( 1 ) wi^ich travels freely for and suffers a collision in dr^^, one has simply 

= A(xW ^i) (32) 

where A(x(^^, tj) is the density in phase space of a single molecule at x, t. 

It should be noticed that among the molecules to be found in x^^^, dx^^'^ at ti with 
velocity in some are just being deflected into d^^^, but by far the large 

majority of them have already acquired a velocity in before passing merely 

through x^^), dxP^ at t^. It would be incorrect to identify the average flight of the 
above set of molecules after t^ with the free path of a single molecule, and this is just 
Tait’s free path l^r in the direction of 

which for equilibrium state becomes 



where the bar denotes the average with respect to the Maxwellian velocity dis- 
tribution. 

In order to determine the real free path one has to select from the above set of 
molecules those which are just starting their new free paths in dtj^. This can be 
done by makmg use of the function The probability of finding a molecule in 
'xP-\ dxP-'> at time with velocity in which is defiected into in dt^ 

is ti) ^i) Hence the probability that a mole- 

cule with any velocity in dx^^ at time is deflected mto 5®, d?® in dt^ is 

It can be shown that iu gases in 

equilibrium the above probability reduces to 
and in gases which are not in equilibrium it reduces to 



‘ 102 


L. M. Yang 

That this is correct in the equilibrium state can be inferred from the principle of 
detailed balancing. For, according to this principle, to every type of collision there 
exists the inverse kind occurring with equal frequency, exactly undoing the effect 
of the first. We therefore conclude that the probability of finding a molecule with any 
velocity at time in dx^^^ which is deflected into in dt^ is equal to the 

probability of finding a molecule in dx^^'> at time ti with velocity in 
which is deflected into a motion with any velocity in t^, dt^, i.e. 


= h) (33) 


For the non-equilibrium state the distribution function ij) changes with 
time so that the set of molecules in x<^', with velocity in increases in time 

£ 

0ii 

term given above. 

An altenative proof which may clarify the situation still further is as follows: 

Let be the probability of finding a molecule 

with any velocity in x®, dx^^) at time which is defiected into in dty, travels 

freely for and collides in T^a, dr^^. Then it is seen that the number of the set of 
molecules t^, Tja) dx^^^dl^dTja must be equal to the integrated sum for all 

values of of the number of molecules starting in x®, dx®> (x®> = x® — with 
velocity in d|^), passing through x<^), dx^^^ in dt^, and arriving at x®\ did®) in 
< 2 , dta- (The subscript 0 is used to indicate events prior to the instant i^.) Therefore 


d^i by the amount | 


■f 5 ®- 5 ^Vi(§®.xW,ii)d 5 ®dx<Wd<i, which is just the extra 


dxWd5a)dT,2|Pi(5«,x(o),«o.ro2)dto = Fi(5W,xaU,,Ti2)dri2dx(>)d|W, 


where the integration is to be carried out over all If the time intervals are 

counted from ^ 2 , one can replace di^, dr^j and dr^a by dr^^, dt^ and dJa respectively. 
Thus 

dx«»d5a)cf<,£“p,(5a),x(®)-?a)roa,t2-To2,ro2)droa 

= W X® - 5® Tia, ta - Tia, Tij) d?® dx® d< 2 . 

The volume elements dx®, dx® can be chosen equal. After differentiating both sides 
with respect to r^a, one obtains 

_P^(5(«x®-|a)T,a,i2-Tia,r,a)=;^lfx{?®=x®-?(«T,a,i*-r,^ 

= d^ |/i(5^'> Tia, h-Tii) /t(5®, X®, <a) exp x®- |(Wt, t^-r) dr | . 

Hence Pi(5® x®,tx,Txa) = j|/4(|a),x® <x)+|-.^|®. 

xm-f|®T,#+r)dTj. 


xMx(» 50 ),i^)exp 


(34) 



103 


Kinetic theory of diffusion in gases and liquids. I 

By definition of x and with the help of ( 34 ), one has 

xW, <i) x(W, T12) dTi, 

= ( 35 ) 

which is the relation to he proved. 

It should be added that equation ( 36 ) is merely another form of Maxwell and 
Boltzmami’s equation. For using ( 28 ) one has 

= J{A(?<»^xW «i)x( 5 <®^?«,x(« fi)-A(?o>,xa),#i)x(|»^ 5 W^^^^^^ ( 36 ) 

Having obtained a:®', T12) in ( 34 ), one can define the real free path lj,j 

for the direction as 

IJ Pi(?(W,X«, «„ri2) 

ijif = Yj. . 

For the equilibrium state it becomes the usual Maxwellian free path 

7 

where the bars denote the average with respect to the Maxwellian velocity dis- 
tribution. 

The extension from a single flight to a series of successive flights is immediate. 
For two steps one defines as 

the probability of finding at ti in 'k^^\ with any velocity which is deflected into 
in ^1, dt-^ travels freely for an interval 7^2, is deflected again into d^^'^ 
in ^2 j travels freely for another interval and finally collides in 733, eZvgs. Then 
one can write 

h, T12; h. rzz) 

= |(M?«,x<v,«,)+|-+r>.^,)A( 5 ®,xa),«,)dxa^ 

X |exp [-£%(?«, xW + t^+r) dr] x(|<«, xW, t^) d5(=*)d<2| 

X {exp[-£>( 5 ® x( 2 )+|( 2 )r,« 2 +T)dT]/ 6 ( 5 ® x<®),i3)d^^^^ ( 37 ) 

where thefirstfactorgivestheprobabilityoffindingamoleculemx®,dxO^) with velocity 
in starting its free path in d(\, the second factor gives the probability 

that it will travel freely for t-^^, be deflected into 5®, in dt^, and the last factor 



104 L. M. Yang 

gives the probability that it will travel again for and suffer a collision in 
By making use of the properties of x (28), (36) it can be shown that 

Jjp2(12) = ■Pi(l). 

where Pi(2) and ^2(1^) used to denote 

Pi(?W,x®,i2,Ta3) and P2 (?<«,x«,«i,Ti2; 5 ®,x®,« 2,T23) 

respectively. 

Similarly, the probabibty function of iV successive flights can be written down as 

P^(|a),xW,«i,Ti3, ...,5W xW,i^,T^ ... 

= P^(l, 2, ..., JV')dx(«d5Wd«i ... 

= x®, <x) + + ?<« . ^)) A(5<«> 

X |exp X® + 5® T, ti + t) drj k+i) | 

— |expj|^— A(?^-^,xW+5^'^v,i;i^+T)dTj|d%^jv+i‘ (3®) 

As expected the following relations are automatically satisfied by Pjv(l , 2, . . . , Y) : 
j|Pj,(l,2,...,J\r)d5Wd!T^, iV+l ~ •?iV-l(l>2, ...,^-l), 

J|p^(l,2, ..., = P^_x(2,3,...,Y), 

or more generally 

= Ps_r_i(r+l,...,s-l)(5-r>2). 

For the equilibrium state, P^^ reduces to 
P^(l, 2, ...,iV)dxa)d§®dfi ... 

i=l 

X — {exp[-M5W)%,v+i]}‘^’'v,v+i- (39) 

®^iV.V+l 

4. SELI’-DIFB’trSION IN GASES 

Having found Pj^(l,2, ...,A^) in (39) for a gas in equilibrium, one can calculate 

the mean time of flight Ai and mean square displacement (Ax)® for any number N 

of successive flights. To conform with the condition imposed on Af in the last section, 

we choose a large number of flights so that (Ax)®/Af approaches a plateau value. 

It can be shown that /n \ at 

A< = <Sw) = Y<rx3> = 4=, 

\i-l / nil) 


( 40 ) 



105 


where 


Kinetic theory of diffusion in gases and liquids. I 


and n is the number density ; the angular bracket { ) is used to denote the average 
with respect to P^. 

To prove (40), one has to calculate using (39) and (33), 




J...Jp,(l, 2, .... i)d§Wdri, ... 

= -i= f... f/(l) ~ dE® = — , 

w/i(i)J ^71)’ 


where 


M12) = x(l‘^U®). 


Let 1® = 5®T^^i+i, then the mean displacement (Ax) = ^2 obviously 


vanishes by symmetry. The mean-square displacement (Ax)® 
be shown to be 


N \ 2 \ 

21®) /will next 
i=i // 




We have identified ^2 1*^^ obtained from Pj^ with (Ax) and 

(Ax)® from iff without comment. In fact they are really slightly different; while in 
the definition of the first flight definitely starts just after the last collision, the 
function ■>]r only specifies the probability of a displacement firom a given initial 
position which may be any intermediate stage in a free flight. However, such differ- 
ence becomes negligible when one considers what happens in a large number of free 
flights. 

The meaning of the successive terms, apart from the constant coefficients 

(^~) ~ ^ *^® bracket, in (41) is this : the first term inside the bracket 

is the TTiaiu contribution from each individual step, the second term results from the 
correlation between two successive steps, the third term from the correlation 
between two steps separated by one flight, etc. Physically it is clear that such corre- 
lation decreases rapidly for each step, and the above series is therefore a rapidly 
convergent one. 

'To prove (41), one writes 

( S 1®T^ = S <(1®)**) + 2^2 <(1® • !'*■+'>)> + ^s\(l® . + . .. - (42) 

Wi-l // i=l i“l 4-1 



L. M. Yang 


So, using (39) and (33), one has 


<(P)2> 


|... Jp,(l, 2, i) ... 

J. . . jp,(l, 2, . . ., i) . .. ,,.+1 


/i(i)J i«(iV 


. !«+*)) = 




J...Jp,+^(l,2, ...,i + fe)d?a)dn, ... 


y-p' 


.. x{i, i + 1) . . . A:(i + *- 1, i + A:) 


«,/t(l) 


/{(i) .../t(i + ifc) 


ll_i . ^(i+*)j^^(£) . . . d^W+fc) 


-y-f 


' /(l);\;(12)...x(^,^+l) 
n n{\) ..,ii{h+l) 


^(1) _ ^(*+1)^(1) _ _ _ cJ^»+l). 


The above expression will give for h — 1, 2, ... the second, third, etc., terms in 
(42). Equation (41) follo'^rs therefore from (42) and (43). From (40) and (41), it is 
seen that for large N the above ratio of the mean square displacement to the time 
of travel approaches a plateau value, and the coefficient of self-diffusion is therefore 

6 3U n /£(1)“^ ^JJ n n(l)/i{2r ^ ^ ^ 


r/(i) ;^(12 )a:(23) 


Ew.EWd:5(i)d5(a)d?(s)+, 


' JJJ n M1)M2)/£(3)’ ■ "T ' ' 

The above calculation refers to pure self-diffusion in an equilibrium gas consisting 
of only one kind of molecules. The extension to an equilibrium gas mixture is im- 
mediate. Consider a binary gas mixture in equilibrium. Let % and be the number 
densities of the two kinds of molecules, /^(i) ( = /i(^Jf^)) and /^(i) the 

densities in their molecular phase spaces, and Hx{i) ( = /ti(^ia)) and ( = liiHf)) 
the total collision frequencies of a molecule of the first kind with velocity and of 

a molecule of the second kihd with velocity respectively. One has 

=PiiW+Pi2W. (46) 

where yaii(i) represents collision between molecules of the first kind and fi-iS) 
between molecules of the first kind and second kind. Using the same type of sub- 
script, we have for the deflexion probability % 

XS = » + 1) = Xii(i » + 1) + A:i2(^ i + 1). (46) 

We can then write, for the coefficient of self-diffusion of the first kind of molecules 

^ sU n, +JJ /£i(1)/£i(2)^i 




(1) Xi(12)a:i(23) 
h. /*i(1)Px(2)Pi(3) 


E?) . Ef (47) 



107 


Kinetic theory of diffusion in gases and liquids. I 

It is intoresting to note that "while in the usual gas theory the coejBficient of self- 
diffusion is obtained from the coefficient of ordinary diffusion in a gas mixture by 
making the two different types of molecules identical, a rather indirect way to achieve 
the purpose, the present theory enables it to be calculated directly from a Maxwellian 
distribution. In particular, while in the usual gas theory to calculate the coeiScient 
of self-diff usion in a binary mixture one has to consider a ternary mixture which is 
much more complicated, the present theory is able to deal with a gas nadxture in 
quite a similar way, as for a simple gas. 

In order to see the meaning of the various terms in (44) more clearly we shall 
rewrite it as follows: 


// N \2\ N iV-l iV-2 

(Ax)2 = / S )= S 

Wiaasl / / issl issl 1=1 

= NiP-f) + (iV - 1 ) <2ia) . 1<2)> + (J\r - 2) <2ltt) . F)> + . . . , 


l(Ax)2 <Z«*> f, , <2l«.ie)> , <2ia).l(3)> 


6 A* 6<Ti2>r^ <iW> 




-j- ... 


(48) 


Equation (48) shows clearly that the various terms of the series merely take into 
account the effect of the presistence of motion on the rate of diffusion in a rigorous 
way. 

To see that the value of Dg is not appreciably changed by choosing different ranges 
of interaction for any general force law between molecules when the chosen range is 
not too small, we rewrite (48) in the form 



<21(«.1®)> <21W.1®>> 


(49) 


Increasing the range of interaction will tend to reduce the length of free path and 
mean life of free flight but leave their ratio, the average speed, almost unchanged. 

This is approximately the factor ~ ~ in (49). As to the series inside the bracket 

V 12/ 

in (49), its sum is almost qonstant for a sufficiently long range df interaction, though 
the rate of convergence does depend on this choice of range of interaction. The exact 
nature of the convergence cannot be discussed until and x have been calculated 
for some particular force law, but qualitatively one can see that as the range of 
interaction is increased, the first term tends to decrease, but then a small angular 
deflexion between successive flights becomes more probable, hence the second term 
increases, etc. A convenient limit for the range of interaction can be set up as follows : 
it must be larger than two to three times the molecular diameter found, for example. 


from measurements of the second virial coefficient in gases, but it must not extend 
up to a distance comparable with the average distance between the molecules corre- 
sponding to the density of the gas considered. Formula (44) is valid so long as these 
two limits do not come close to each other, which is always so in gases. Choosing 
a high limit to the range of interaction will give very slightly more accurate results, 
but this is entirely offset by the slowness of the convergence of the series. Numerical 



108 L. M. Yang 

oaloulations of the coefficient of self-diffusion for rigid sphere gas molecules are shown 
in appendix 0. 


5. OeDINABY and THBEMAL niFFtJSION IK GAS MIXTIJEHIS 

It has been found possible in § 2 to express the mutual dijBFusion velocity in non- 
uniform fluids in terms of the first and second moments of certain transition pro- 
babilities. It has also been shown in § 3 that in non-equilibrium gases one can follow 
the future course of a chosen molecule provided the velocity distribution of the 
medium is known. The essential point is that the statistics of the motion of gas 
molecules is based on the knowledge of the velocity distribution throughout the 
medium. The mutual diffusion velocity according to (22) with ^^’s and oc’s calculated 
from Pv (38) should be, and as shown in appendix B, is indeed equal to that calculated 
directly from the local velocity distribution, i.e. 

(60) 

where and/^^g) are the deviations from the Maxwellian velocity distribution 
due to the presence of the first-order gradients of densities, temperature, etc., of 
molecules of the first and second kind respectively. 

Nevertheless, the present calculation shows the intrinsically different natures of 
self-diffusion on the one hand and mutual diffusion on the other; the former is simply 
a kind of Brownian motion, while the latter is entirely caused by the non-uniformity 
of the physical parameters. Though in non-uniform gas mixtures self-diffusion. stiU 
goes on, it has no effect on the diffusion velocity which can be pictured as set up by 
forces arising from the local gradients of densities, temperature, etc. In order to 
obtain an exact value of the mutual diffusion velocity it is indispensable to consider 
the local deviation from Maxwellian velocity distribution consistent with the local 
gradients of the physical parameters. 

When the p'’s are neglected in (22), the remaining part is seen to correspond to 
the free-path theory. Though the present theory wMch takes account of the per- 
sistence of successive flights rigorously is much more refined than the old free-path 
theory, they are both based on the same assumption that molecules which collide 
during each element of time have a distribution after collision of the Maxwellian 
type. Mutual diffusion, though entirely different from self-diffusion in nature, can 
thus be approximately expressed in terms of self-diffusion. Equation (22) becomes 
when the p'’s are neglected 



a^dnA\ 

Z\\dxdn^^dx ^ 0x dTj \n^ dx~n^dxji‘ 


( 61 ) 


With the aid of this equation both ordinary and thermal diffusion receive simple 
explanations. The and appearing in (51) measure the tendency to diffuse for 
the two kinds of molecules. Mutual diffusion can be seen from (5 1 ) to arise in two ways. 
First, the number of molecules diffusing in one direction is not equal to that in the 
opposite direction, and secondly, the tendency for diffusion varies in different ways 



Kinetic theory of diffusion in gases and liquids. 1 109 

with «i, n.2 and T for the two kinds of molecules. Equations (26) and (27) become, 
when the p'’s are neglected, 


I>o = 

Dy = 



(52) 


Thus Do consists of contributions from both of the two processes described above 
under constant temperature and pressure, while D^ arises only from the second 
process under constant pressure. It may be mentioned here that the old free-path 
theory does not explain thermal diffusion because there only the first efifect is taken 
into account. Meyer’s diffusion coeflaoient (cf. Jeans 1925) corresponds to 


D„ = 



(53) 


Furth’s explanation of thermal diffusion (Furth 1942) corresponds to the second 
equation in (52). 

It should be mentioned that the neglect of the (3'’s is no more justified than the 
whole procedure of the old ff ee-path theory. For gas mixtures in which one component 
is comparatively rare, equation (52) would give good results. This suggests that a 
corresponding approximate theory exists in the liquid region which can be obtained 
without any knowledge of the deviation from Maxwellian velocity distribution, 
provided the coefficient of self-diffusion in an equilibrium liquid mixture is known. 
This is now under investigation. 


The author wishes to express his thanks to Professor M. Bom for his constant 
interest and encouragement, and to Dr H. S. Green for many helpful suggestions 
and discussions and for reading the manuscript. 


ApPBSTOIX a. CALOtrLATION OB' /t AND X SPECIAL MODELS 

As the collision frequency /t and the deflexion probability x ar© independent of 
the frame of reference chosen, we shall assume a frame of reference moving with the 
local mass motion velocity u. From (46) and (46), it is seen that each of /4i(l) and 
Xi(12) consists of two parts. It is obviously sufficient to calculate /ti2(l) andXi2(12). 
We shall denote by vj'^ and the velocities of a molecule of the first kind before and 
after a collision respectively and v^^ and those of a molecule of the second kind. 
Further, each of /tia(l) and Xi2(12) can be expanded 

/‘la(l) ='/42(l)+Ai2(l) + ”-> ?(;i2(12) = ;^2{12)-1-Xi2(12) -1- ..., 

where the first term arises from collisions with those of the second kind of molecules 
in the medium with the MaxweUian distribution, and the second term from those 
deviating from the Maxwellian distribution due to the presence of the gradients 
dXijdx. As /«J2(1) and xkS^^) not required in the present paper, only and 
;\52(12) will now be calculated. 



110 


L. M. Yang 

Bigid sphere model 

The collision, frequency of rigid spherical molecules calculated on the assumption 
of bindary encounter and molecular chaos is given in most text-books on the kinetic 
theory of gases (of. Chapman & Cowling i939)- The result is 

= (Al) 

where 

<^ia = i(o-i + <r2),, ya = (A2) 


(Tj and are the molecular diameters of the two kinds of molecules and mi and 
the molecular masses. 

To calculate ;kS2(12), we proceed at first in a similar manner as in the calculation 
of /^2(1), that is, by counting the probability of the occurrence of a specific type of 
collision but then integrating over all possible types of collision for which the chosen 
molecule is deflected into a specific velocity range v^^', dvf^ after collision. Thus one 
obtains 

fi2{l2)dtd\^i^ = 

R 



where /§(v^^^) is the Maxwellian velocity distribution of the second type of molecules 
normalized to its number density, g ( = v^^^ — v^) is the velocity of a molecule in the 
medium relative to the chosen molecule, bdbde is an elementary cross-section or 
‘target area’ in polar co-ordinates on a plane passing through the chosen molecule 
and perpendicular to g, b being the impact parameter and e the polar angle. The 
integration is to be carried out over a region B de fi ned by 

B: vf) < vi»(v^\ b) < vf -h d-vf>. 

For binary encounter we have in general 


^ 9 . 

m, 


sg.jtx. 


(A 4) 


where p (= vj-^) is the velocity of the chosen molecule after collision relative 

to its vehcity before collision, and x is the unit vector in the direction 

of p. For rigid spheres we have in particular 


b = or 12 sin f, gbdbde = g. xo-fa dQ^, 

where dil^ = smijr d’tjr de is an elementary sohd angle in the direction of x and ^ is 
the angle between g and x. Equation (A 3) then becomes 


>32(12) dvf = jj/|(g + v?)) orfag . xdQ.dg, 


where the relative velocity g has replaced as integration variable. If x is chosen 
as the new jr-axis the region R can be described by two conditions : first, p must lie 
inside and secondly = g . x must lie within the limits 


^0 

2m. 


P^9i< 


^0 

2m2 


(p + dp). 



Ill 


Hence 


Kinetic theory of diffusion in gases and liquids. I 

xS2(12)(Zvf) = J_„ /W+S)‘^9'a^9'3. 


where and are the two components of g in the plane perpendicular to x. After 
performing integration over g^ and g^, one obtains 


/ Wa '' 


( 

2 exp 

\ 

i 2kT 

~«^o 1 
-2^2' 


1 

\27TkT) 

l2OTaj 

f v(‘-*)-v?>| 


(A 6) 

It may be verified that 

J ^:S2(12) dyf == /iS2(i), J m ;^2(12) dvf) = m)iiU2), 

as one would expect. 

General force law 

It has been mentioned in § 3 that for the general force law the classical cross-section 
diverges and the collision frequency /i also diverges due to the weak interaction of 
the distant molecules. Fortunately, we do not need to consider such weak inter- 
action. Variation of r^ will only result in the redistribution of the contributions from 
the successive terms in our final formula for the coefficient of self-diSusion (44). 

Let ^{r) be the potential energy between two molecules expressed as a function 
of the distance r between their centres of gravity. We can take ^(r) = 0 for r > r^. 
The expression for /4i2(l) takes the same form as (A 1) with replaced by r^. It is 
the function x which takes the actual force law inside the distance r^ into account. 
We shall indicate here how Xi2(12) for the general force law can be calculated. As 
before, one has 

dvf = JJ/ 2 (V 2 ^^) ghdbdedyf, (A 6) 


B: 


. 2m2 


g.xK^p-t-dp. 


The differential equation of the orbit of relative motion in polar co-ordinates is 


dB bn mr)\ y 

dr~rLr Mg^J J ’ 


(A 7) 


where M = — is the reduced mass, r the radius vector and 6 the corresponding 

polar angle. (The polar axis extends from the centre of the chosen molecule along 
the direction of — g.) 

At the moment of nearest approach, the polar co-ordinates of the incoming mole- 
cule being JR and one has drjdd = 0. Hence 

2« , b^ 



112 


L. M. Yang 


from ■which iS as a function, of 6 and g can in principle be found. At the moment when 
the incoming molecule first enters the sphere of the influence, its polar co-ordinates 
are and <?o = sin-^ (&/»'o)- (-^ ®)» regarding 0 as a function of r and 

integrating between R and r^, one has 

from which 6 as a function of g and ^ can in principle be found. Squation (A 6) 
becomes then 


R 

/*+<» r+oo r 
J — 00 J — CO J . 


sm.'^lrA^dei^ 




mo 

2m2^ 


gHg,t) 

% 

sin^ 

dtp- 


dgidg^dgs, (A 10) 


where ijr inside the integral is expressed = cos“^ 


-1& 


Appendix B. Calcttlation op a .^ and por non-uniporm gas mixtures 


The assumption that the physical parameters Aj vary only smoothly ■will be made 
in the calculation. One can always express A^ in the vicinity of a given point x by 
means of Taylor’s expansion. For the present purpose we shall neglect the presence 
of a velocity gradient and assume a constant mass-motion velocity in the region 
in which ^(Ax, M) is appreciable. The velocity distribution functions of the two kinds 
of molecules at x, t normalized to their respective local number densities Wx(x, t) 
and « 2 (x,i) are denoted by /i(5i,x,<) and /^(^a.x.f). The following abbreviations 
will be used hereafter: 


A(2)3=A(??U®),«3), /t,(2)3 = /t(5?),xW<3), %(2) = »,(X<»U2).1 

Ai(23)4 = Df = 

ax®“ t9x(W0Aj, 9«i“ 




(Bl) 


Further, if the pressure is taken to be constant during diffusion and it is supposed 
that no external force field is present, one finds 

where v^^ ( = is the pecuHar velocity of a molecule of the second kind on its 

ith flight relative to the local mass motion velocity u. 

Both fi and x later stage of the motion of the molecule can be expressed 

in terms of their values at the starting-point: 


(B3) 



113 


Kinetic theory of diffusion in gases and liquids. I 

By means of (B 3), Pjv(l> 2, . iV) in (39) can be expressed in terms of the values 
of quantities at the starting-point ty, neglecting gradients of the second and 
higher order. The mean time of flight, mean displacement and mean square dis- 
placement relative to a frame moving with u can be found by simple calculations. 
The mean time of flight is found to depend on the direction of the velocity in this 
flight, but as the difference can at most be linear in the gradients, it will not affect 
our final result for a^, Pi wrhich is obtained by using the average time of flight for all 
directions in each flight. Further, though such times averaged over all directions 
are different for individual flights, the difference can again be ignored for the same 
reason. 

As it is understood in the following that the chosen molecule is of the first kind and 
quantities pertaining to any later stage of its journey are always expressed in terms 
of those at the startiug-point xW, ty, we shall drop subscripts attached to/, y and % 
and write /(I) for fy(l)y, y{i) for /iy{i)y and for + With this 

simplification one writes, for the mean time of the ith flight over all directions. 


where 

and the total time of flight for N successive flights is 

The successive mean displacements relative to u are 

JJ-Pi(l)rrfdva>dr, 


<ia)> = <v<»ri,> = 


<l(2)> = <vfr,3> = 


JJPi{l)dvtt>dri3 

JJp2{12) yfh^dyfd-vfUTy^dTz 
JJp2(12) dy^HyfdTy^dr^ 


where 


(B4) 


(B5) 


(B6) 


Pi(l) = (Ml) + -Di") /(I) (exp [- J^"“(M1) +^i2J5iV(l)) -fTiaPMl)) 

ra I 


neglecting terms involving the second or higher powers of 9Aj/0x. 


Vol, 198. A. 


8 



114 


L. M. Yang 

With the help of (B 7), equation (B 6) becomes 


Hence the total mean displacement (Ax) is 


J vS«dv?> 


The mean square displacement when quantities involving the gradients are 
neglected is found to be the same as in the equilibrium state. The required expressions 
for <Xi and Pi follow from (B 6), (40) and (B 8) for large N : 

- tf™ llr)'^”+ JJ® ■■■) • 

With the help of (B 2), can be reduced to the form 


K-^J/'(i)W+i§|5K«.)=»;+ 


3?ii0xW 




(B 10) 


Substituting (BIO) in (13), an identity is obtained which proves the statement 
made at the beginning of § 6. 


Appendix 0. Evaluation oe the oobpeioient op selp-dipptjsion in oases 

To evaluate the coefficient of self-diffusion in a simple gas as given in (44), we need 
/t and X which are given in (A 1 ) and (A 5) of appendix A for the rigid spherical model. 
For a simple gas /t? 2 (l) and ;\22(12) reduce to the form 

/.(!)- «)nT*(^*J!(S(,), 

Pi 2 = yi+yl- cos (? 12 , A?( 2 /i) = e-2'5 -1- ^2yi + i j JJ e-®“da:. 

di 2 is the angle between y^ and y 2 , n, m and cr are the number density, the molecular 
mass and diameter respectively. 





115 


Kinetic theory of diffusion in gases and liquids. I 

Substituting (0 1) in (44) one obtains 


"wbere 


D 

* ZitcrhiX m ) 


/2kT\i 

\m~) 


/•“e-Wdfj/i 

"“"Jo E{yf) ’ 


Cl = 2 


oE{yj)Il{y^) 

00 rn p g_(y.+3,;+j,j) / 


( 2 / 12 / 2 )® sin ^12 COS 6»i2d0i2%i 052/2. 


"® 0 j 0 J 0 J 0 j 0 JSl{y^) E{y,) E{yf) pP 




sin ^21 sin (923 Gosd^i,dd^-i^de^dy-i^dy^d/y^, 

\PnP23 / 


The term Cq has been evaluated accurately by means of the table of functions 
provided by P. Herberg to be 0-2093. To estimate the values of the rest we make the 
following approximation; the velocity spread of the Maxwellian velocity distribution 
is replaced inside the integral by a distribution of constant average speed with random 

orientation, i.e. ®^P ~ 2^ ^ replaced by — ^ | ® - iA>) dv^'), 


/2feT'\t TT — 2 

where v® = 21 -^ j , or e"J'* is replaced by j 5(1 2 / 1 1®— 2 / 1 ) with ^i = ^. 

By introducing such 5-functions, Cj and can be worked out after short cal- 
culations to be 


where 


Up to the third term the value of Dg is 


® \ m / ' mrcr^ \ ^ J 


This estimation serves, on the one hand, to show the rapid convergence of the 
series of constants Cq, Ci, ... and on the other to compare with the expression given 
by Chapman and Pidduck, 

i>, = (l-019)^(^*= 0-779^(— )*. (C6) 

^ ^ ' Sna^ \7Tm) nncr^ \m I 


8-2 



116 


L. M, Yang 

Rbfbrbnobs 

Chandrasekhar, S. 1943 Bev, Mod, Phya. 15, 1. 

Chapman, S. & Cowling, T. G. 1939 The mathematical theory of mn-‘Un%Jom Cambridge 

University Press, 

Einstein, A. 1905 Ann, Phy$., Lpz,^ 17, 549. 

Einstein, A. 1906 Ann, Phy$,, Lpz,, 19, 371. 

Flirth, R. 1942 Proc. Boy, Soc, A, 179, 461. 

Jeans, J.H. 1925 Dynamical theory of gases, Cambridge University Press. 

Kennard, E. H. 1938 Kinetic theory of gases. New York: McGraw-Hill Book Co. 
Rosenberg, P. 1942 Phys, Bev, 61, 528. 

Wang, M. C. & Uhlenbeck, G. E. 1945 Bev, Mod, Phys, 15, 1. 


Eddy diffusion of water vapour and heat near the ground 

By F. Pasqtjill, Meteorological Office, London and School of Agriculture, 

University of Cambridge 

{Communicated by Sir Geoffrey Taylor, F,R,S, — Received 17 December 1948) 

An experimental study has been made of the factors involved in the turbulent transport of 
water vapour and heat in the lowest layer of the atmosphere over well-exposed level grassland. 
Measurements were made over periods of 1 hr. of the water loss from isolated but otherwise 
naturallyexposedsectionsof thesurface layers of the soil and quantitative arguments advanced 
for adopting them as a reasonable approximation to the true evaporation loss from the 
ground surface. The incoming and reflected components of solar radiation, tho temperature 
distribution in the soil down to 16 in. and the vertical profiles of temperature, humi<iity and 
wind speed in the air up to a height of 2 m. were observed at tho same time, and samples taken 
to provide necessary data on the physical properties of the soil. I^e net flux of long- wave 
radiation was computed from tho temperature and humidity structure of the atmosphere as 
given by the present low-level measurements and routine upper-air soundings. The data 
prescribe the vertical turbulent flux and the vertical gradients of the water vapour and boat 
content of the air, from which may be evaluated the vertical components of the eddy difluaivi ties 
for water vapour and heat (Ky and K^) as customarily defined. 

In the absence of thermal stratification of the surface air layers Kyvs shown to be identical 
with the eddy diffusivity for momentum (K^) defined by the explicitly established logarithmic 
law relating the aerodynamic drag and vertical wind shear over a rough surface. The modi- 
fication of Ky by unstable and stable thermal stratifications and the rapid decrease of stability 
influence as the ground surface is approached are both quantitatively demonstrated, and a 
unique relation between parameters involving Ky, the 'sfertical wind shear and the vertical 
temperature gradient is indicated. No completely satisfactory a priori explanation can as yet 
be given for the latter relation, though in unstable conditions iCy is found to be identicial with 
K^ computed from a recent wind-profile law which does not involve the temperature gradient 
explicitly and has only been established in functional form. Direct comparison of Ky and 
reveals a reasonable approach to equality in stable conditions but shows that the latter ooefii- 1 
cient is systematically and substantially the greater in unstable conditions. Tho latter feature is I 
qualitatively in keeping with recent trends in the theoretical concepts of turbulent transpor 

The bearing of the results on the problem of indirectly evaluating natural land evaporation 
is briefly discussed and attention drawn to the implied superiority of the present ‘hydro- 
dynamical’ approach over the classical ‘heat-balance’ method. 


1, iNTROBXrCTIOlsr 

The earliest considerations of the effect of the turbulent structure of the lower atmo- 
sphere, initiated by Taylor (1915, 1917), clearly demonstrated the existence of a 
diffusive action much more powerful than any which could be attributed to molecular 



117 


Eddy diffusion of water vapour and heat near the ground 

motion. Fronci data then available on the vertical distribution of wind velocity and 
temperature in the atmosphere Taylor deduced ‘eddy diffusivities’ (analogous to 
the kinematic viscosity and thermal diffusivity in molecular transport) of the order 
of 10» to 10® cm.^sec. and confirmed in a broad sense his theoretical deduction that 
the eddy transport of conserved properties of the atmosphere may be prescribed 
by an eddy diffusivity which is independent of the property undergoing transport. 
Hydrographic observations by Jacobsen (see Taylor 1931) indicated, however, that 
the latter feature did not necessarily hold in a fluid possessing a marked density 
stratification and in which gravitational forces were thus of dominant importance. 

The precise magnitudes of the eddy diffusivities appropriate to the transport' of 
momentum, matter and heat, ^and their dependence on other physical factors, still 
occupy a central place of interest in problems of turbulent diffusion. Much accurate 
information has arisen &om laboratory studies of aerodynamic drag, evaporation 
and heat transfer in turbulent flow through pipes and over flat plates, and it is note- 
worthy that in the case of aerodynamically smooth surfaces there is good evidence 
(Pasquill 1943) for close numerical equality in the eddy diffusivities for momentum, 
vapour and heat, except for small differences which may reasonably be ascribed to 
different magnitudes of the appropriate molecular diffusivities. Data for atmospheric 
flow are necessarily less precise and are complicated by the natural roughness of the 
earth’s surface and the frequently marked thermal stratification of the surface layers 
of the air. In the absence of the latter feature treatments by Sutton (1934, 1947) and 
Calder (1949), assuming identity in the transport of momentum and matter, have 
made substantial advances in our understanding of the spread of smoke and vapour 
introduced artificially into theatmosphere . On the other hand, the present knowledge 
of the processes of diffusion of the water vapour and heat content of the atmosphere 
is stiU based in the main upon more or less reasonable conjecture untested as yet by 
critical observational data. Furthermore, a recent notable contribution to the 
theoretical side of the problem (Priestley & Swinbank 1947) introduces in the heat- 
flux equation a ‘ buoyancy ’ term which had not been explicitly recognized previously, 
though the importance of gravitational forces in turbulent processes had already 
been noted by Taylor (1931 and unpublished), and in broader implication implies 
the possibility of there being different values of the eddy diffusivity for the turbulent 
transport of different properties in a thermally stratified atmosphere. 

An observational study with quantitative bearing on some of the above features 
is presented and discussed in this paper. The study is restricted to a shallow surface 
layer of the atmosphere and to flow over a level well-exposed surface of small aero- 
dynamic roughness, but it covers a substantial range in the magnitude of the wind 
velocity and the degree of thermal stratification of the atmosphere and provides 
some appreciation of the problem of indirectly evaluating natural evaporation. 

2. Obsebvational aspects 

The observational problem consisted of obtaining values of the vertical flux of 
water vapour, in effect the natural rate of evaporation from the ground, the vertical 
flux of heat by turbulent transport and the vertical gradients of hmnidity and tem- 
perature. From such data the magnitudes of the eddy diffusivities for water vapour 



llg F. Pasquill 

and heat are directly obtainable, while from measurements of the vertical profile 
of wind speed and appeal to aerodynamic laws the eddy diffusivity for momentum 
may also be evaluated in certain circumstances. The measurements were made on 
a clayland pasture on the University Farm at Cambridge during spells of fine weather 
in March 1948, For the wind directions then obtaining this site is suitably level and 
unobstructed for the establishment of horizontal homogeneity in the structure of the 
air up to a height of 2 m. Upwind of the instruments there was always at least 160 yd. 
of level uniform terrain (see Deacon 1949, on the distance required for the setting 
up of a homogeneous wind profile). The nearest substantial variation in level occurs 
beyond 160 yd. to the south-east where the ground slopes gently into a small valley, 
while the nearest obstacle of important size is a close about 60 ft. high covering a front 
of 260 yd. at a distance of 600 yd. to the south-south-west. Bare soil was visible over 
a considerable proportion of the pasture and the patchy grass cover was mainly 
2, cm. long with occasional withered tufts up to 6 to 7 cm. and showed no appreciable 
change in length over the whole period of observation. 

The essential features of the assembly of instruments are illustrated in figure 1. 
The vertical profiles of air temperature and humidity up to a height of 2 m. were 
explored over individual periods of 1 hr. writh a portable distant-indicating thermo- 
couple psychrometer apparatus developed specially for the purpose and described 
in detail elsewhere (Pasquill 1949), while the corresponding profiles of wind speed 
were observed with sensitive cup anemometers which are a development of a type 
described by Sheppard (1940). Masts supporting the instruments at heights of 200, 
160, 100, 50, 37-6 and 26 cm. were set up in line across wind. During each 3 min. 
period the differences in dry-bulb and wet-bulb temperature between the 200 cm. 
level and each other level in turn were read to 0-01° F on a suitable indicating gal- 
vanometer, and the absolute values of the temperatures at the 200 cm. level were 
read separately to O-l® F. In the earlier observations this procedure was maintained 
throughout the whole period of 1 hr., the anemometers being continuously in opera- 
tion over the same period. At the end of the run the psychrometer mast was swung 
into a horizontal position across wind and five sets of readings taken so as to provide 
zero corrections. At the same time all anemometers were mounted at a height of 
2 m. and compared over a period of 20 min. or so. In these control observations there 
was sometimes reason to suspect that the apparent inequalities in psychrometer 
performance were partly spurious and due to lateral wind swings around the closely 
disposed instruments, causing a given psychrometer to receive air which had been 
in contact with the irradiated surface of an adjacent instrument. A modified control 
procedure was accordingly adopted in all subsequent observations. In this the 
observation period was divided into two equal parts with a 12 min, interval, and 
for the second part the psychrometers and anemometers were systematically inter- 
changed, the 200 cm. instrument with that at 100 cm., 160 cm. with 37'6cm. and 
60 cm. with 26 cm. On the basis of laboratory measurements of thermal lag it was 
known that the period of 12 min. was adequate for the adjustment of the inter- 
changed psychrometers to their new environments. This procedure complicated the 
observational and analytical work but on the whole was thought to provide a more 
dependable instrumental control. 



Eddy diffusion of water vapour and heat near the ground 119 

For the evalxiation of the vertical flux of heat it was necessary to measure or 
estimate the magnitudes of the remaining components in the balance of heat- 
exchange processes* The direct and diffuse solar radiation on a horizontal surface and 
the component reflected from the ground were measured by a solarimeter calibrated 
at Kew Observatory, one reading of each component being taken in each 3 min. 
period. These measurements were made only with practically clear sky, so that 
individual readings did not show the pronounced variation associated with scattered 
or broken cloud, and the frequency of reading adopted could thus be expected to 
yield representative mean values. 



Fioubb 1. Schematic dia^am of array of instruments. 


Soil temperatures at various depths were measured by a distant-indicating multi- 
point thermometer consisting of a non-conducting cylindrical stem with copper 
strips set in various positions flush with the outer surface. Fine thermocouple wires, 
enclosed in the stem, terminate in single jxmctions soldered into each copper strip. 
The essential principle of the instrument is that it may be installed in the soil, merely 
by driving it in, with a minimum disturbance of soil conditions together with the 
maintenance of adequate contact between the soil and the temperature-sensitive 
surfaces. The temperature differences between the thermocouple at a depth of 16 in. 
and those at 12, 8, 6, 4, 2, 1, \ and approximately Oin. were read to 0*05°F during 
each 3 min. period, and a reading corresponding to the difference between the 16 in. 
element and an element immersed in a water-bath at known temperature was also 
taken so as to specify the absolute values of the temperatures. 

In order subsequently to obtain data on the thermal properties of the soil samples 
were taken with an "undisturbed core’ sampler consisting of a steel tube with a 
cutting rim tapered on the outside and inset shghtly on the inside, the cutting dia- 
meter being 4-5 cm. On driving the sampler vertically into the ground compression 



A20 F. Pasquill 

and distortion of the soil occurs outside the tube, while a relatively undisturbed core 
of soil is left inside the tube, from which it is quickly removed, out into suitable 
sections and weighed immediately. These samples were taken down to a depth of 
8 in., at a single point about a yard away from the soil thermometer, on three occa- 
sions corresponding to the general periods when heat-balance observations were 
made. 

Evaluation of the long-wave radiation term requires, among other items, a know- 
ledge of the radiative temperature of the ground. It has been shown by Robinson 
(1947) that to a good degree of approximation the radiative temperature of ground 
covered by short grass is indicated by a spirit thermometer lying on the ground 
(actually a Meteorological Office pattern grass minimum thermometer), even when 
the sun is at appreciable elevation. Single readings were taken in each of the 3 min. 
periods, this again being considered of adequate fre(iuenoy in clear weather when 
violent oscillations of the indicated temperature did not occur. 

Measurements of the natural rate of evaporation from the ground were attempted 
by means of soil evaporimeters of simple design. Undisturbed cores of soil 4 in. deep 
were extracted by means of a sampler similar to that already described but of cutting 
diameter 3^ in. These were inserted in close-fitting pots made of Tufnol tube of 
internal diameter 4 in. and wall thickness ^ in., so that the soil surface was flush with 
the rim. Corresponding but slightly larger containers were embedded in the ground 
with the rims flush with the surrounding soil surface, so forming clean watertight 
receptacles for the soil containers. The effect was thus to isolate sections of the 
surface layer of the ground from surrounding and underlying soil without markedly 
disturbing the conditions existing therein, while leaving the actual surfaces exposed 
as in the natural state. These soil evaporimeters were set out about 16 to 20 yd. 
upwind of the temperature and wind instruments and weighed at the beginning and 
end of the observation period of 1 hr. on an indicating balance with an accuracy 
better than 0-03g. The losses ranged from 0-1 to 2-5g., and corrections (usually 
about 5 %) were applied for the abnormal exposure during the weighing process. 

3. The intterpretation oe the evaporation measurements 

The direct determination of the true rate of evaporation by the preceding method 
is complicated in particular by the unavoidable isolation of the test soil from the 
underlying layers of soil, though it was thought that it might be possible to choose 
circumstances in which the effect of this limitation could be neglected. The site of 
the present measurements is on a deep bed of clay, the top layer of which normally 
dries out fairly progressively from the surface downward during the spring and 
STimmer. It follows that there may be certain stages in the drying history of such 
ground when isolation of the soil at some shcdlow depth would not appreciably affect 
the rate of loss of water from the surface /or some period. Such a stage possibly occurs 
in the early spring before drying out has proceeded to any great depth and before 
root growth has penetrated deeply, thus possibly drawing water from lower strata. 

The optimum dimensions of isolated soil cores consistent with maximum depth 
of isolation, capacity and accuracy of balances available and anticipated order of 



121 


Eddy diffusion of water vapour and heat near the ground 

the rate of evaporation, were in the region of 4 in. diameter and depth, and these 
dimensions were accordingly adopted for the main soil evaporimeters. Two features 
suggested qualitatively that the isolation imposed at a depth of 4 in. was probably 
unimportant in the present measurements. In the course of extracting soil cores 
it was observed that the root growth rarely penetrated below 3 in., so that tran- 
spiration was probably unaffected. Furthermore, it was unmistakably evident that 
appreciable drying had occurred in the top 2 in. of previously undisturbed ground, 
whereas there was no visual indication of drying having occurred at lower levels. 
However, a quantitative test was clearly necessary, and an approach towards this 
was provided in the following fashion. Additional soil pots were made to take soil 
cores of the same surface area but of depths 2 and Sin. respectively, and on all 
occasions one or both of these was used in conjunction with the normal 4 in. evapori- 
meters, If the vertical isolation exerted substantial influence on the water loss from 
the core it would be expected that the effects would show up to an increasing degree 
with the decrease of isolation depth and with the passage of time. 


period 

of 

group evapora- 
and tion obs. 
date (min.) no. 

A 75 6 

10. iii. 48 75 7 

75 8 

75 9 

75 10 

^ B 75 11 

18. iii. 48 75 12 

75 13 

75 14 






2 in. 

no. 4 

no. 5 

no. 6 




no. 5 

c 

70 

15 

09.38 

0-78 

0-92 

1-00 

0-95 

— 

— 

— 

M8 

24. iii. 48 

70 

16 

11.30 

0-69 

0-87 

1-00 

0-89 

— 

— 

— 

L47 


70 

17 

14.30 

0-68 

0*84 

1*00 

0-82 

— 

— 

— 

1*45 


70 

18 

16.30 

0*77 

0-87 

1-00 

1'06 

— 

— 

— 

0*88 


70 

19 

18.30 

0-63 

0-77 

1-00 

0-67 

— 

— 

— 

0*30 

C' 

70 

20 

10.30 

0*71 

0-91 

1-00 

0-84 

— 

— 

— 

M4 

25. iii. 48 

70 

21 

12.35 

0*76 

0-92 

1*00 

0*99 

— 

— 

— 

1*43 


70 

22 

15.30 

0*80 

0*84 

1-00 

0-87 

— 

— 

— 

1*28 


70 

23 

18.30 

0-62 

0*85 

1-00 

1-00 

— 

— 

— 

0*26 

D 

70 

24 

10.30 

0-81 

0-89 

1-00 

0-91 

— 

— 

— 

1*52 

28. iii. 48 

70 

25 

12.33 

0*76 

0-83 

1*00 

0-86 

— 

— 

— 

1*62 


70 

26 

14.42 

0'82 . 

0*97 

1-00 

0*97 

— 

— 

— 

MO 


70 

27 

16,30 

0-64 

0-91 

1-00 

0-94 

— 

— 

— 

0*64 


70 

28 

19.00 

0-11 

0-67 

1-00 

LOO 

— 

— 

— 

0*09 

D' 

70 

29 

11.39 

0-94 

1-08 

1-00 

1-00 

— 

— 

— 

1*35 

29. iii. 48 

70 

30 

14.03 

0*79 

0-98 

1-00 

1-00 

— 

— 

— 

0*85 


Note. Evaporimeters nos. 1 to 6 are of depth 4 in. 

* Refers to midtime of observation period of approximately 60 min. 


Table 1. Evapoeatiox data 

actual 

loss 

relative loss from evaporimeters in g. 

( X from 


G.M.T.* 

2 in. 

3 in. 

no. 1 

no. 2 

no. 3 

no, 4 

no. 5 

no. 5 

10.33 

1*62 

1*21 

1*38 

2*19 

1*40 

1*33 

1*00 

0*82 

12.30 

1*69 

M4 

M7 

1*62 

1*24 

1*25 

1*00 

1*00 

15.30 

1*69 

1*03 

1*33 

1*49 

1*27 

1*39 

1*00 

0*75 

17.30 

2*92 

2*17 

1*83 

2*83 

2*17 

4*17 

1*00 

0*12 

19.30 

1*82 

1*00 

1*29 

1*41 

1*29 

1*82 

1*00 

0*17 

12.35 

0*75 

0*75 

0*79 

0*85 

0*83 

0*89 

1*00 

2*48 

14.30 

0*74 

0*74 

0*77 

0*88 

0*88 

0*82 

1*00 

1*60 

16.34 

0*73 

0*68 

0*85 

0*80 

0*89 

0*68 

1*00 

0*99 

19.32 

0*53 

0*61 

0*66 

0*84 

0*76 

0*68 

1*00 

0*38 



122 F. Pasquill 

Details of the water losses observed during each observation period are listed in 
table 1. The observations have been grouped into sets in which the same soil cores 
were employed for a number of consecutive observations) these sets being designated 
by letters A, B,0 and D, a subdivision, indicated by a dashed letter, being made 
when a set of soil cores were employed for 2 days in succession {C and D'). The figures 
include the actual loss from one of the evaporimeters (no. 6) and the relative losses 
from the remainder. For group A evaporimeters nos. 1 and 2 and the 2 and 3 in. 
auxiliary evaporimeters were installed the day before, whereas nos. 3, 4 and 6 had 
been in position for 8 days, during which period each had lost about 25 g., which 
corresponded to approximately 10 % of the original water content. For group G 
the installation was carried out 2 days preceding, while for groups B and D it was 
carried out on the same morning. 

Group A , in which the maximum number of evaporimeters was employed, provided 
the most critical test of performance in view of the wide range of evaporimeter age 
also involved. In any one observation the losses indicated by individual evapori- 
meters vary substantially, though in the whole group of observations there is 
apparently a fairly consistent relation between individual evaporimeters. We may 
therefore conveniently examine these results in terms of the mean relative losses 
for the whole group, and these are reproduced below: 


evaporimeter 


no. 1 

no. 2 

no. 3 

no, 4 

no. 5 

evaporimeter depth 

2 in. 

3 in. 4 in. 

4 in. 

j 

4 in. 

4 in. 

4 in. 

evaporimeter ‘age’ 
mean relative loss 

1-9 

1 day 

1-3 1-4 

1-9 

1*5 

7 days 
2-0 

1*0 


From these figures no systematic effect of either evaporimeter depth or evaporimeter 
age is evident, and it seems legitimate to conclude that the differences exhibited are 
primarily due to a genuine point-to-point variation in evaporation, arising from 
heterogeneity in soil moisture, vegetation cover and local exposure. From inspection 
of the rest of the observations, for which in each group evaporimeter age was con- 
stant, there is again no pronounced indication of a systematic effect of evaporimeter 
depth. On the argument put forward these features strongly suggest that in the 
soil situation then existing the sofi. isolation imposed had no important effect on the 
rate of loss of water from the surface, and that the losses observed with the 4 in. 
evaporimeters may be assumed to be a reasonable approximation to the true rate 
of evaporation.* 

4. EEDtrcTiosr oe data on hxtmxdity, tempekatube and wind proeilbs 

Mean values of wind speed, temperature and computed absolute humidity at 
each of the six heights are given for each observation in table 2.f Bach figure repre- 
sents a mean value over a period of 1 hr. , and in the case of temperature and humidity 

* The writer was aware, prior to designing these measurements, that a method of 
measuring natural rate of evaporation, using plastic evaporimeter tubes extending to various 
depths, was in course of development by W. C. Swinbank (private communication). The 
actual details of technique and interpretation described here were evolved independently and 
in accordance with the particular soil circumstances and observational requirements involved. 

t This and tables 3, 4 and 6 are reproduced facing pp. 128 and 129. 



Eddy diffusion of water vapour and heat near the ground 123 

the mean is derived from twenty readings in observations 6 to 14, sixteen readings in 
observations 15 to 30. The figures are given to an accuracy consistent with the 
nominal accuracy of reading, though with regard to temperature and humidity it 
should be noted that only the differences from one height to another, and not the 
absolute values, are nominally accurate to 0*01° F and 0*01 g./m.^ respectively. 
Since, however, the interest lies primarily in the gradients of these factors, the 
differences, and not the absolute values, are of paramount importance. In appraising 
the real accuracy of these figures the control measures adopted should be kept in 
mind. The first procedure, in which psychrometers and anemometers were compared 
at the same height, was employed in observations 6 to 14; the second and more 
reliable procedure of systematic interchanging being adopted for the rest of the 
measurements. The greatest weight is accordingly attached to observations 15 to 30, 
particularly as regards the differences over the intervals 200 to 100, 150 to 37*5 and 
50 to 25 cm., since for these intervals it may reasonably be assumed that systematic 
instrumental inequalities not covered by the calibrations are automatically eHmin^ 
ated by the interchange process. 

Interest first naturally turns to the forms of the vertical profiles, since from these 
we require to obtain the vertical gradient^ of humidity, temperature and wind speed, 
and furthermore we propose to utilize certain laws relating the wind profile and 
aerodynamic drag at the earth’s surface. Earlier studies of the wind profile in par- 
ticular have recognized the influence of wind speed and vertical temperature gradient 
on the form of the profile. On physical grounds an interdependence of these factors 
is to be expected, since the magnitude of the temperature gradient clearly determines 
the initial tendency towards modification of the flow by buoyancy forces, while 
the strength of the wind and the associated shearing forces constitute an opposing 
tendency and may be expected to control the extent of the modification. These ideas 
have been developed by a number of workers (see Brunt 1939, p. 242), and the 
general belief now is that the so-caUed ‘stability' of the atmosphere is specified by 
the following number, usually termed the Richardson number, 

gidTIdz^V) 

T {duidzf ’ 

where T is in °K, F is the dry adiabatic lapse rate, dTjdz and dujdz are the vertical 
temperature and wind velocity gradients, z being measured upward. The sign of the 
above number is of course controlled by the sign of the temperature gradient, 
positive and negative values denoting stable and unstable thermal stratifications 
respectively. Zero or relatively low numerical values imply a state in which thermally 
induced buoyancy forces are entirely absent or dominated by large dynamical 
forces. For simplicity these conditions will usually be described as ‘stable’, ‘un- 
stable’ and ‘neutral’. 

As an illustration of the general features of the present vertical profiles, three 
small groups of observations have been selected as representative of moderate 
instability, near-neutral conditions and moderate stability, the mean values of the 
Richardson number at 75 cm. being —0*056, —0*005 and +0*067 respectively. The 
selection has been made from observations 15 to 30, which, being subject to a more 



124 


F. Pasquill 

satisfactory control than the earlier observations, should provide a more critical 
demonstration of the forms of the profiles. The mean profiles for each property and 
each group are shown graphically in figure 2 , a common reproseutation of all three 
properties, wind speed, temperature and humidity, being achieved by plotting 
against the logarithm of the height, the values of the parameter 

^25-4 

'S'25 — ^50* 

8g representing wind speed, absolute humidity or temperature at height z cm. In 
neutral conditions both wind speed and absolute humidity show a close approach to 
a linear relation with the logarithm of the height. The remaining profiles of wind 
speed and humidity and the two profiles of temperature exhibit a systematic depar- 
ture from linearity, such that in unstable conditions the (S’/logs curve is concave 
upwards and in stable conditions convex upwards. This is wholly consistent with 
recent profile results quoted by Sheppard ( 1947 ) and by Deacon ( 1949 ). The departure 
of individual points from straight lines or smooth curves provides some indication 
of the general accuracy and representative nature of the measurements. As might 
be expected, greatest consistency has evidently been achieved in the wind-speed 
and dry-bulb measurements. 



S^-Sz 
<S'j6— 'S'jj 


PiotTRE 2. Vertical profiles of absolute humidity, wind speed and air temperature 

above short grass. 

mean Richardson 

observation no. no. at 75 om. 

X 16, 17, 20 -0-066 

® 18,22,20,29 -0-006 

A 19, 23 -f 0-067 

Since in the following analysis considerable importance is attached to the form of 
the vertical wind profile, a more detailed demonstration of this feature is of some 
interest at this stage. Wind profiles corresponding to the individual observations 
in the more accurate group (nos. 16 to 30) are shown in figure 3, the parameter 

^ being plotted against height on a logarithmic scale. The profiles are here 

^25 ” ^50 

arranged in the order of the Richardson numbers at 75 cm., and in order to em- 
phasize the variation of the profile with the latter number straight lines have been 



125 


Eddy diffusion of water vapour and heed near the ground 

drawn through the points at 25 and 60 cm. With the exception of a single observation 
(no. 30) these wind profiles exhibit, in a more detailed and systematic fashion, 
the variation with Richardson number already indicated by the three selected 
groups above. The essential feature is that when the Richardson number is numeri- 
cally small, i.e. in near-neutral conditions, there is a linear relation between u and 
log 2 . In unstable conditions (Richardson number negative) u increases less rapidly ' 
than log 2 , while in stable conditions it increases more rapidly than log 2 . Analytical 
interpretations of these profiles will be noted in subsequent stages of this paper. 



-125 -66-52-51 -19-17-17 -8 -5 -4 -4 -1 +4 +58+76+123 J?ixl0®(at 75cm.) 

^25 

Figure 3. Vertical profiles of wind velocity over a short grass surface 
in relation to Richardson number (Rt). 


In general two procedures have been followed, according to the general consistency 
of the profile data, in evaluating the vertical gradients of temperature, humidity and 
wind velocity. For observations 6 to 14 smooth curves were drawn through the 
individual /S/log 2 ; profiles and tangents drawn to these curves at approximately the 
mid-point (actually 76 cm.), i.e. well away from the extremities of the curve, so that 
experimental scatter and personal error in drawing would be unlikely to lead to 
unreproducible results. For observations 15 to 30 the greater reliability attached to 
the differences over the various height intervals involved in the interchanging process 
permitted a reproducible arithmetical derivation of gradients at a number of heights. 
The basis of this is that except in cases of extreme curvature of the log height curve, 
the variations with height of the property 8 may be represented approximately 
by th. equation S - <.+61og(e+»), 


whence 


d8 _ b 
dz ( 2 +c)’ 


where b and c may be derived from values of the property S at three heights. Actually 
c was found to vary systematically with height, so the method adopted was to obtain 
values of c appropriate to the lower and higher sections of the profile, using the 
differences 200—100, 150 — 37-6 and 150-37-5, 50—26 respectively. These values 
of c were used to deduce dSjdz at 150 and 37-5 cm. respectively, while a mean of 
the two values was used in computing dSjdz at 76 cm. The values so obtained are 
reproduced in table 3, u, x and T referring to wind speed, absolute humidity and 
absolute temperature respectively. Values of the Richardson number are also 


126 F. Pasquill 

tabulated there. A few omissions occur in groups C, C, D and D' when gradients 
could not be derived with confidence, owing either to ineflioient performance of the 
100 cm. wet bulb or to apparently spurious features in some part of the profile. 


6. Eddy diyittsivity and bvapobation in nbdteal conditions 


From the observed values of the rate of evaporation and vertical gradient of 
humidity we may now derive magnitudes of the vertical component of the eddy 
diffusivity for water vapour, Ky, and compare these with the corresponding magni- 
tudes for momentum, Consideration will be restricted in the first instance to 
the simpler case in which the atmosphere is in a neutral state as indicated by numeri- 
cally small values of the Richardson number. In specifying the deficiencies 
of existing theoretical approaches may be avoided, following Calder (1949), by 
appealing directly to the weU-established law for aerodynamically rough flow through 
pipes (Nikuradse 1933) and over flat plates (Schlichting 1936), and to the evidence 
for the validity of this law in the surface layers of the atmosphere. The laboratory 
law, written in a form convenient for meteorological application (following Prandtl 



where is the mean wind speed at height », Tq the horizontal shearing stress at 
z = 0 , p the air density, h an aerodynamical constant of value 0 ’ 40 , and Zj the rough- 
ness parameter of the surface. 

Differentiating equation (1) and substituting in the equation defining JC„„ i.e. 


T — 


3 z’ 


we have, with the customary assumption (proved by Ertel 1933) of the constancy 
of T with height in the lower atmosphere. 




u-yk H 

iog^o’ 


where % is the mean veind speed at some arbitrarily chosen level z^, or 


( 2 ) 


K - 

* 9 z ■ 


(3) 


The applicability of the functional form of equation ( 1 ) to the lower layers of the 
atmosphere in neutral conditions is now widely accepted, and the profile data already 
discussed here provide additional confirmation of this featureforashort grass surface. 
However, the feature which is of most critical importance in the present analysis 
is the exphcit validity of equation (1) in. atmospheric flow. This was established 
recently by Sheppard (1947), who measured the drag of the earth’s surface (i.e. the 
horizontal shearing stress) in near-neutral conditions and substituting in equation 
(1) obtained values for h close to 0 - 4 . Thus, for neutral atmospheric flow near the 
surface, we shall assume that equations (2) and (3) with h = 0-4 specify true values 
of S'Hfl proceed to compare such values with those of Ky deduced directly from 
the present measurements. 



127 


E+i 


■dz, 


(^) 


■dz 


Eddy diffusion of water vapour and heed near the ground 

Making the reasonable assumption of negligible horizontal variation of absolute 
humidity Ky may be directly obtained from the appropriate integrated equation of 
transfer (neglecting variations of air density with height), as follows: 

UM - f| 

employing the time mean values of 3%/9z, dxl^t and rate of evaporation E. Values 
of E, obtained directly from the water losses shown by the 4 in. soil evaporimeters, 
are included in table 3. In practice, however, it was easily verified that for the 
shallow layer involved here the term on the right-hand side of equation (4) could 
be neglected. Indeed, the magnitude of 

1 r*-2oo0j^, 

iJo ^ 

for many of the observations was less than 0- 005, while the m ax i mum value, occurring 
with the low rates of evaporation of observations 9, 23 and 28, was only of the order 
of 0-02. Thus with sufficient accuracy 

and in order to facilitate comparison with the form given for in equation (3), the 

above expression is reduced to the non-dimensional parameter 

Ky — E 

dz dz dz 

and is tabulated in this form in table 3. 

If Ky and Kj^ are identical then from equations (3) and (6) we should have in 
neutral conditions 

I = i = 0-4. 


( 6 ) 



Values of the above parameter, corresponding to oases with Richardson number at 
76 cm. (Bi'js) numerically less than 0-005, have been obtained from table 3 and are 
reproduced below. A considerable range of wind speed is covered by these cases: 






II \ 

observation 


^200 

height 


no. 

Ehs 

(cm./sec.) 

(cm.) 

\ dz dz/ 

8 

0-000 

512 

76 

0-39 

13 

0-003 

244 

76 

0-51 ‘ 

18 

-0-001 

422 

37-5 

0-41 




76 

0-44 




160 

0-44 

22 

-0-004 

490 

37-6 

0-36 




75 

0-37 




150 

- 0-39 

27 

0-004 

376 

75 

0*38 




150 

0-37 

30 

-0-004 

664 

75 

0-46 





mefitn 0-41 



"^28 F. Pasquill 

It is seen that the above figures, though somewhat variable, as might well bo expected 
from the nature of the measurements made, have a mean value very close to ()-40. 
As far as is known this provides for the first time a direct verification of the identity 
of the eddy diffusivity for water vapour with that for momentum. 


6. The iurmuBNOB ob atmosphbbio stability 


We may now turn to a more general consideration of the data assembled in table 3, 
where a substantial variation in the stability of the atmosphere is indicated by the 
wide range of the Richardson number, which at a height of 76 cm. amounts to 
- 0-125 to +0-123. The foregoing considerations have demonstrated that in neutral 
conditions both wind speed and absolute humidity vary linearly with the logarithm 
of the height in the surface layer and that the parameter 


-E 
dz dz 


Ky 
* 02 


is constant. In other atmospheric conditions we have already seen that the vertical 
profiles of wind speed and absolute humidity depart from the simple logarithmic 
relation in a similar and systematic fashion. It is now evident from table 3 that in 
such conditions the above parameter is no longer constant but varies, between 
very wide limits, over the range of atmospheric stability involved. An obvious step 
was to investigate the possibility of a connexion between this parameter and the 
Richardson number, and the result is shown graphically in figure 4, where all values 
are plotted. A consistent variation of the parameter with the Richardson npmber is 
exhibited, and there is no suggestion of a systematic separation of the points 
according to the height above the boundary. A close approach to a unique relation 


between 


and Richardson number is thus established for the shallow 


surface layer here considered. For purposes of further discussion a smooth curve 
has been drawn through the points and corresponding values tabulated below: 


Ri 

- 0*26 

- 0*20 

Krr 

• 1 

0-72 

0*58 


- 0-16 - 0-10 - 0-06 

0-46 0-34 0-24 


0-00 

+ 0-05 

+ 0-10 

0-17 

0-11 

0-07 


An increase in the diffusive action of the atmosphere in unstable conditions, and 
a reduction in stable conditions, both of which are demonstrated by the above data, 
are to be expected on any plausible theory of turbulent transport. The main interest 
of the present results lies partly in the magnitude of the effect, and we note that 


a tenfold variation in the parameter Ky 



occurs over the range of stability 


covered here. However, we should also note that the high numerical values of the 
Richardson number, and hence the greatest modification of the above parameter, 
are associated with the 150 cm. level. The numerical values of the Richardson 
number decrease rapidly as the ground is approached, and at the lowest level the 
range of numbers computed here is only -0-036 to +0-046. The corresponding 




Q 


K3 


S*!;? 00 


^ ^ ^ k 

Qi to ** 


9 

® a* 

d- OQ 

^ I 

S^J 

§■1 
g" »- 

S ID 
OQ'^d 

^1 

xn 

I-. ‘ 


® ft 

P B- 

m p> 

r 4 

§ ° 

5 ' B' 

if 

•T 

pi 


M 

s 

76 

CO 

-4 

150 

76 

CO 

<i 

w 

p 

0 

75 

CO 

"j* 

160 

76 

CO 

-4 

w 

p 

0 

76 

CO 



6x 



p 



p 



cn 



p 

p 

cm. 

p 

cm. 

p 

P 

cm. 

cm. 

p 

cm. 

cm. 

P 

p 

p 

p 


tocooiocno njCO^O 
Oi-aoooo Cn-JOO 


ISD CO Ol O Ol O 

Ox -jl o o o o 

6i 


CO ^ 

-4 F® ^ 
CO CO 

o 


gsssss gsggfegi p ® 

Ai^i^tat-O “griseg'^ omooeoojw « 

l_d^^)^C0*<IC0Ob90t00 CO 


a»<»a>a»<»oi ggggg^ » ^ 

ooicooicofc® ■^boooi->cno SS 
OOOOl<I^“^Ca<»C?l<OCC> o 


ojojawa® §§§S|| S ® » 2 f" 00 

COOCO^II^fP^C&CDOOOO O 


E^ -5? 3S{ 

• ^ s 

00 h> 3 
<=> to o ^ 


*«*«q,0.§2?||gssg^gg 

H-it-*o<o<Doo oiot^cogo*^ cot-<*>q>^ 03 fco 
OOCOCPCOOfP* CWCflH-'Jf^O 


CO QO ^ 
3 CO o 

o 


S 

oooo-<i<iOda) cooftoocoo^oo co>ai>9CD<i^ co ^ 
Oilf^OCJiCOOOt^ir-'OfcOO O 


OiOOlOlOlCJl popfefefe SSooJSSS 1® m 

j-* 00 ocnif^co osCoOil^t--co oococorfs^oo^ r® 
-qjfi.cooocx)»f*‘ 0 ot>afco>^o> «i 


»-. 

§ w b® 
o 


cnotwoiwie. §§§99| SJSgggg ^ g 
C»OXI«^I-»OCO 00 a»tf!^H-OCD i_i>--,4i.<lOOO o 

<i»f»wOJ<O<ICOOiO&CO 00 l-» 


g> 


en cn C3I cn C7I Cti 

C3I C 3 I C7l O O O O O ^ HJ 1— > bO bO bO j— i 

vvvvy* ojoocobocoHi^ w co 

IJOI_IH1COOO<I bOCOCOCOt^CSi I— iH-iOObOlP^ ^ 
COCOCO<lrf^OOOlbOCOrf!k.^ 


^ Bg s ^ 

n 3 C ^ 


fefepwl&fc £!S5io^!S ^ ^ 

i^cocobot^)^ oow3^ooP S? 

bo^cn<ico>-*c«-^cDcooi 


p 

1 


CO 

b® 

b® 

b® 

1 

w 

I 

CO 

p 

b® 

p 

0 

0 

P 

•4 

1 

w 

1 

p 

ll^ 

b® 

1 

b® 

1 

b® 

CO 

CD 

w 

b® 

P 

p 

CO 

0 

0 

p 

•4 

0 


CQ fco § 


: ^ 3 g 

I— ^ CO 


s 

00 CO 05 

c> 


ppppp^ SoE^coSS ^ 
COtO>-' 06 «JI^ OobiibOH^CD ooOOCO^Ico Si? 

CO1--'COOS||S^-^CD»-'00H-'H-« 00 


01C71C?IC?|C7TC?I 

COMCOl^tO^^ £oE2SS*^ 
>^cbdbc 7 it^^€o cboxco-^ij^fco ^ScjxooBS ^ ^ 

oocncjxH-oosboooooocn '^o 


1 s 


rf*» bO H-* • h-* 

COOOOO bocjxfco 

^ooi tf^cort^ tf^'O&oo 


Q H " § ■’ 


C7X C71 CTI C7l Cn C7l M 

oicnOxCTxCnCTi c3sa>c»xoxC7t<^ rt*" h- 

CO<IOiCOI-<h-< l^bOCODxJ^*^ J— iCOOxOOCOCO ^ 

CJxOCOl--CDCODibOOiOx<C> wiwww^ 


t 

■<1 + I + 

05 b® H- t-* 

o 


CDCD05 ^ * T CO^OS !i;C005 

oxSiox Oborf»‘ cococTx cnboco 


O CQ fe S** g 
a ^ ^ ^ ^ 


SSSSSS ^ 

SSSSS^ b®<io5b®co^ g 


- 1 - + + 
CD ox b® 
C71 00 <0 


“^“ ”H H-J 1-^ 

Ob® coo*::^ cooai-< co'*^S!? 

OXOX OOCDrt^ ODOl^I ^COCW 


S S S>oS K- 


S ^ g, S 3 « 


(fik. 

cococotfkJKSx ‘T;i::;fc5bobofco po ^ 

•iiosocjxjf^j^ >^»^a»ob-jo 0 S 000 S 3 SS ^ 

CO00COO50D‘>q lii^COOOH^ o 


I ( i 

CQ Ox b® 

H-I 


Ka Ka H-i ^ ^ ^ 

OiCnO cocpcn oo*^b® -Joaifi* 


CO !z! !=! W CO 
b® O O CQ bo 


^ {> q M 


gs P g 

•4 CO 

o 


Ox C51 C7I i®i cn cn t! 

cnoxC7iC?ii^tf»- K^5£i2?5P® P bo 

o>jf^rfi.»l<cocb 6 - 4 i^cbob 6 i ^ ® 

COOCJXCOCDOX “^Jfa^CJlCOO O 


90 CO QO o 

o CO CO *4 00 


S S ^ 3 5 J2 

«=’ l> D H “ g 


C3X ^ J 

O 00 ^ jfi. w 
CO b® C71 05 


CTlOXCTlOlCTXCTX 

COCDCDOOOOOO COCOCOW!k.>fi.|^ bO 

SSSggS CO H- 


I I I 

00 Hi. J-. 


crxcoco I ^^55 CncDO coi^CD 
OOlOOObOOCOCDCD OSH^ 


1 1 1 £ g 

Q H o 


Ox cn Ox C7X C7X Ox 

COCOCOCOCOCO CDCDCDOOOOQO COCOCOf^H^Hi. OX 

I^h^ocbobob oSSoJSo ^ 

<l00|^0>005 05b®-4tfk.«D C^WDXOTCDO ^ 


f^05<D’pl-'SO OiSi?'^ 

Oloobocooooxbocn 0-4 


s ^ i g g 

Q « ® g "■ 


Hii C5X Ox I—* 

l^l^l^p^CO -40000CDO»--| MH-MbObOb® 00 ko 

i-oooo<i<i cbi^ooob^o ttSSSSfS CO w 

COOiMOiHi^O i-pbOOXb® C»<I00 H^h-O g 


I M bO 

C« O CD b® 

p <1 p M lf». 


HX ^ Ol O hj> 

CO »> 2 b® ^ 

o B ^ o 


P P Ox P p cn 

CocoH^-rt^Pp o t>a 


, H-t M 

CO b® 
I o C7X 


H-< b® 
P l-» CO 
b® P 


M cn bo 

t> 2 O W C7X 

b H w 


^ p p p p^ 
SI p W 60 A** 
o O <0 J— p 


P P p P P C7X I-I 

PPP-4-4P COHi^H^ppp ^ ^ 

S£5£5® SSSSiSS g ^ 


hJ I— • M ^ ^ ^ H-l b® -m 

CJilUltk F-^b®P Pf— >b® H-«P^ 
OPO Koop Soco P-5P 


hj l-« 

C® S ’ ^ fc® 

c® ^ o 2 <55 

<=> \> h ^ ^ & 

o 


..^<^^^,^*-.^C?XpPPPp H- 

c^pppcnp ppp<i<i<| cocoi^i^pp 

5DP-4PHXC0 Wbb>^(X)-405 feSSPS’Sb® Sk 

cow HS^CObOp l^COWCiw COP-aHi.Op g 


>1^ CO ^ P 
W CO *4 CO *4 


M o 2 JS p b® 

00 w 2 ^ w ^ 

p {> tel o 

o 


p Ox Ox p CTX P w 

pppppp PPPPP05 |Wb®b®cotoco p Ka 

J^CDP^PP boiiwcocoi^ SpScoSp ^ 

COPPPPO COPPPP wwoawwtwgooi ^ 


o OQ CP b® 
fel S b 00 


pcnpppp !&|^pplsS wwwww 

WPWCOWOP pwpfeSS 
PCOPPOCD WPCDPP wi—O5b0P<l 


.U -I I 

_w p b® 




b® W W I ^ ^ CO ^ ^ 

g pCD WCOP •4H^W J^bDP 

po POl(!fc. t— 'WCO Opo 


w CO 
p bo w 
CO P -4 


0® 

CO “ji: 

"w 0 

m 

0 ^ 

2 

t i 





fc® 

QQ P w S 

w {O 


» ^ w 
goo 

it^ CO 


opqpppqpqp ^si^i^iaap ppp- 4 <i<i j— 

SfcSSi:^'^ p6xipL6cb«4 o£52fe£ b o 

’^Wpt-JbOl— • POOCOP v»wl^O*4COl— • ^ 


»§§§§§ it>-if-oioia>a> !S 5 ^ 

SSsSgSSgigSggS-^^SSS^Sggog 




Tabib 4. Soil tbmpbrattjkb data ajstd heat oapaoities 


Oft 00 o 1-1 

I o 

?o 4* o o o 

cq 


O OIQIOIOOVOICOIO 010101 C 01 C 0100 

o ■^ o « 0 '< iico *- ioc0 "^ ^« C !» Gq « o*-^cocQeq 
cS - ii > a > ciocb«bisib A ^«^ 4 iooooo 
- , , , , , + + + + 


CO iO o 
<e o €0 «p 

^ o o 


O o ec o U 3 

o a& to o »o 

00 ^ ■^ i 0 :; 3:;30000 

2 A « Sa 6666 


<o o o 

cq rH OS 

oq r—C 


« 1000000*0010 * 000000 * 0*0 
.CO 0 < Nt > C » OCpOO ^ OQOfHOlOOpCOCp 
tsq I ^* 0 t ^ db * 0 '^' 4 (' 4 i *0 OrH ^ OlAoOO ' 






o I 0 i 00 i 0 * 0 l 00 » 0>0 * 0 * 00000010*0 

CO < OCpCp < MOpOcp < Nl > O '^ Q 0 i -( qSCN | i 7 i<Nrt 

^ o iooo-^csicoeo*oio '^eocsi^ooooo 

« ^ + + + + + +, I , 


o ^ «o *o 
^ A do o o 


*o cq 
cq lo o CO *o 
J* A i> o o 


CO CO oo cq 00 *o o 

CO ooooriiio ' ii * 
lO * CO’^Oi— li^i-iiHCq 

S db^oooooo 


o CO CO o o cq CD 

. coi^ooiooocqiooo 
• .COOOl— <»— I 
.«jq O 


|> O O 

CD cq 00 

• O Ii 

I cq i-r 


o 

o 

lO 

lO 

05 

C 5 

O 


lO 

rH 

00 

rH 

tH 

iH 

CO 

cq 

1 


O 0 * 0100000 * 0*0 000 * 000*000 

CO <»'^ i>cqaococqocp ocDcDcoi - i ^ i -< coi -» 

o AoibcbAwco^io < n < n <^ cniAoooo 


, ^ cs cq 
‘ CO CO CO io 

' A ^ o o 
cq 


o th *> lo lo cq »o 

m CDOCOCOOSCOrHOO 
2 r - i^OOOOOO 


00 o o o 

^ os l > 

, -t CD CD 

1 cq 


o * 0 * 0 * 00000 * 0*0 1001010 * 0010 * 0*0 
CO e^cocq'^co*ocorH<» i-tiot^coioococoi-i 
00 cbcbdboodbAcoio AoiAAooooo 

I 1 I 1 I 


t- CD tI* 

■ 00 l> os CO CD 

J , 2 o 6 


o CD CO O) 00 CO 'CH 

m lOOOr-lOOCqiO 
‘OS ^ 00 l > 00 .- l 000 

•^Sco 666666 


0 * 00 * 0 * 0000000 * 00000 * 0 * 0*0 P-I 

CO coio < Ncoooioi>oo ^ cpoi > cq'^'^oo cdcd * Ocoio 

2 6 6oi<nA666io6 66A666666 AA666 

'^^ 10 * 010 * 010 ^^ tH '^ I I I ^ IH 


a Tif pH *o cq cq 00 

S Oi - 4 |>* OtHOSCO 
2 (Ngcpowooo 
2 A^^oooooo 


o 0000 * 000 * 0*0 * 00*0000000 

CO COCOI>COlOCOlO'^Cl0 p-ICOCqQOO<M*OOr7< 

A 66 A 66666 Gq 66 6 A 6666 

-^ rtCD * o * o * 0 '« i *' iiiTiirj <^ j + I 


cq *o CO 

00 CD eo CD 

q!, A 6 6 6 


cq |> lO 05 00 CO 
S ooscoooooco'cH 

COOSCOrHiHOrHrH 
^ 6^*^666666 


00 o o o 

lo CO cq 

, CO 00 *o 


O *000*0*000*0*0 *00*0100*0*000 

CO QOCp-cHCD^COt-HCOCO CO'|#<NOCO'^OCOCO 

2 A A 666 A 6 A 66 <^( N <^ cqA 6666 


Tjl CO 
00 CO CD 


lO l > O 'cH O 
0c0*0010'^00 05 

O *“ io 0 i — • cq '^'— ii-H lo 

6^^666666 | 


o o o 

00 CO tH 

CD 05 l > 


oo 01010 * 0 * 0 * 01010*0 10 * 0000 * 00 * 0*0 
CO O 5 CqcO 00 » OCOCOl >' 7 < Ot > lO * 0 '? l <' 7 < CC<NO 

2 6 c^AAcqcqwA 66 66 c^A 66666 


CD tH 
00 CO CO 


l > 05 CO JO r-C 

lO CO cq 00 i-J cq 00 
TfJTHCOrHrHr^rHO 

^ co ^66666 


o o o 

i> 00 cq 

cq i> lo 


O *000*0*00*0*00 010100*0100*0*0 

CO cqiorHi > pHi > oo*ocD 05 cqcqi - Hi >* oco < N <^ 

05 6A6666A66 AAAA66660 


000 

cq 


^ 05 CO CO C5 

” i > cd ^^'^ 9*=>9 ^ 

26 « h 6 o 6 o 4 - 


00 »o 

05 l> CD 
CD 10 rH 


* 00000 * 0 * 0 * 0*0 000 * 000 * 00*0 
^ iococpi>ooo*o<^ co^ c?'^ 9 <»<N' 7 '‘^*'T *9 
05 !>: 666666A66 A6A666660 

^^lOiOlOiO'-iJ^TH^ I I j I |4--|,-|- + 


|> W 5 
CO CO CD 

A 6 6 

cq 


_ 1-H 00 CD M 05 'idJ 

S o 220 rH ^ oqi > ^ 

«CO§OOr^OOO 05 

£;*A 060000 H- 

^ I I 


O 0000 * 0 * 0 * 0*00 * 0*010000000 

^ ipi>0pC3OC0O0pCl ^OcpcOC^CpCNrHO 

00 10 6 AmA 66 A*o 6 AA 6666000 

;^* 0 » 010 * 0 * 0 'i*ItH'^'^ I j j |^^ 4 ._|_| 


i > cq 

CD Cft CD 

6 6 6 

cq 


^ CO CO CO |> l> T*J 

O 10 COt> 10 pH 050 
CO rJJt^COOr-HOOO 
*6 6®^666666 


O 10 * 0 * 00*00000 00000*0000 
CO 0 Cpl> 00 <»(N<»C^ |>cpipW 0 pCprHrt» 7 < 

cq A6A6A6666 AAAA66660 

A io * oioiO '^ Tt < Ti <'^ Ti < j J 


^05 CD I-H 
‘v CD !> Cp CD 
rH rH 05 O O 


O 00 CD UO CD *0 

o * oi > cqo 5 ocqrH 

CO 0*OI>r-<rHpHrHCq 
cq A ®^666666 


lO o O lO 

CD cq TfJ 00 

, 00 00 


eft * 0 * 0 * 00 * 000 * 0*0 00100 » 0 » 0100»0 

g cqcD '>^^* pcocqo <^ ooajipo <^ fHcpopr 7 J 

<o 6 csi66A66666 «AA66666o 

+ ^ + I I I 


00 C55 CO 

•y CO CO cc *o 

. o A 6 6 6 

cq 


^ *0 CO rH t > O tJJ 
£2 _ Or ^ cqo 5 iocqcq 

CO CO^I>rHpHOrHCq 
c ? CD <6 6 * 6 * 


l> o o 

lO *0 CO 

I rH o 

I cq rH 


OHNrHcqTHcoQOcqo orfsiHcq ^ Hoco^o 


^14 

g <^ &-l 

i|ii| 

mlt 


'3 i iJ: 

a - s . s ^ 
■p ^ 

S K | § ^ ■ 

iljf 


I P g ^ 

! ii I X 

[Ad rH 




!S ^ M 
© © rH 





129 


Eddy diffusion of water vapour and heat near the ground 


smoothed values of the parameter Ky j 0 * 11 ^ i.e. a variation of 

about 30 % on the value of 0 * 1 7 at zero Richardson number. Thus, at very low heights, 
say below 50 cm., the influence of a moderate range of stability could be neglected, 
and equations ( 1 ) and ( 2 ) applied to give the eddy diffusivity, without incurring 
errors greater than about 30 % . At height ranges more customarily adopted, however, 
the error so introduced would be very large, and we therefore require to give further 
consideration to the present results with particular reference to the effect of atmo- 
spheric stability. 




Figure 4. Relation between eddy diffusivity for water vapour [Ky), vertical wind gradient 
{dulBz) and Richardson number {Bi) at various heights above a short grass surface. 


group 

symbol 

A 

0 

B 

X 

C 

A 

G' 

□ 

D 

<3> 

D' 

• 


cm. 

undashed points 2 = 75 
single dash 2 = 37*5 

double dash z = 150 


An attempt to prescribe theoretically the effects of atmospheric stability on the 
form of the wind profile in the boundary layer has been made by Rossby & Mont- 
gomery ( 193 s). This treatment, the details of which it is not proposed to discuss in 
this paper, leads to an expression relating wind shear and horizontal shearing stress, 
from which the following equation may be derived: 

Etl == m 

^du (l-fcTjRi)’ 

where h is von Karman’s constant and cr is a proportionality factor introduced in 
the relation between eddying energy and potential energy associated with thermal 
stratification. Deacon ( 1949 ) finds that the Rossby-Montgomery formula does not 


Vol. 198. A. 


9 




F, Pasquill 

agree with wind-gradient observations at large stability and instability, and demon- 
strates that a modified formula, proposed by Holzman (1943) entirely on empirical 
grounds and in the interests of mathematical tractability, provides a better repre- 
sentation. Holzman’s formula may be reduced to the form 


K, 


0Z 


( 8 ) 


which it will be seen is identical with equation (7) when the Richardson number 
is small. 

Equations (7) and (8) are readily compared with the experimental results sum- 
marized above, and it is found that neither equation can satisfactorily describe the 
whole range of results. It is noteworthy, however, that reasonable agreement with 
the results in stable conditions is provided by the Rossby-Montgomery equation 
with cr equal to about 12, while in unstable conditions the Holzman formula gives 
corresponding agreement using the same value of cr. No physical explanation is 
immediately evident for this feature, nor for the magnitude attached to cr. Thus, 
in both functional and explicit form the equations put forward by Rossby & Mont- 
gomery and Holzman fail to provide a completely satisfactory basis for expressing 
the observed relation between eddy diffusivity and Richardson number. To what 
extent this is due to deficiencies in the derivation of these equations or to a genuine 
difference between Ky and in non-neutral conditions is somewhat problematical. 

An alternative basis for examining the present results is to be found in the general- 
ized wind-profile law recently put forward by Deacon (1949), who found that an 
extensive series of wind profiles, exhibiting characteristics similar to those shown 
in figure 3, was fairly weU represented by the equation 



(9) 


in which the exponent yff is >1, =1 and < 1 in unstable, neutral and stable con- 
ditions respectively. Integrating this empirical law and making use of the observed 
decrease in stability influence on the wind profile as the boundary is approached. 
Deacon obtained , , / \ \ 1 « , 


U, 


which reduces to equation ( 1 ) for small values of z and values of /? not differing greatly 
from unity. Actually it was found that values of Zq evaluated from equation (10) 
showed a considerable variation with stability, which is inconsistent with the observed 
decrease of stability influence on the wind profile as the surface is approached. Deacon 
suggests that the variation is probably due to ^ not being quite constant with height, 
except possibly in very stable conditions when aerodynamically smooth flow may 
conceivably occur near the surface and so lead to a genuine reduction in the roughness 
parameter. In order to avoid this inconsistency in the application of equation (10) 
Deacon makes the assumption that Zq is independent of stability (except possibly in 
very stable conditions), its value being prescribed by equation (1) and wind-profile 



Eddy diffusion of water vapour and heat near the ground 131 

measurements in neutral conditions. Using this value of Zq and wind-profile measure- 
ments over the same surface in other conditions of stability equation (10) is then 
employed to calculate a mean value of jS for the height layer involved. 

From equation (10) it is easily shown that is given by 


or 


h-m — ^ 




du 

dz 




m 

dz 





( 11 ) 


a form which is easily comparable with our results for Ky and requires only the 
derivation of Zq and yff, for which the following process was adopted. Tor observations 
nos. 15 to 30, values of the wdnd ratio (which involves the largest height 

interval to which instrument interchanging was applied) were plotted against the 
Richardson number at 75 cm. and found to lie on a fairly smooth curve. The inter- 
polated value of this ratio for iJi = 0 was 1*275, which wdth equation (1) gave 
Zq = 0*25 cm. From equation (10) 


and using the above value of Zq expression (12) was evaluated for a range of values 

of y?. A graphical relation was thus obtained between y? and the Richardson number 

2/?-2 

— j , with k = 0*4 and 


z = 75 cm. 


Interpolated values of the various parameters, together with the observed magni- 


tudes of jEli 


■A'S). 


are tabulated below: 


Richardson no. 
at 75 cm. 

'^150/^37*5 

E _ 
dz dz dz 



-0*125 

-0*10 

-0*05 

1*209 

1*216 

1*237 

1*085 

1*075 

1*045 

0*42 

0*37 

0*26 

0*39 

0*34 

0*24 


{z = 75 cm.) 


+ 0*05 

+ 0*10 

+ 0*125 

1*370 

1*525 

1*640 

0*915 

0*790 

0*715 

0*06 

0*015 

0*005 

0-11 

0*07 

0*06 


A most striking agreement between the computed Kyy^j and the observed 
Krii z^ occurs in unstable conditions, the diflference between the two values 


being less than 10 %, which is almost as good as the agreement obtained in the 
corresponding comparison in neutral conditions. With the increase of atmospheric 


9-2 



F. Pasquill 

stability, a considerable disparity appears in the two sets of values, whichis consistent 
with the previous suggestion that equation (10), with independent of stability, is 
invalid at marked stability. It is important to realize here that equation (10), and 
hence equation (11), though fairly well established in functional form, has not been 
verified explicitly. Thus, the present considerations differ from those previously 
applied to neutral conditions in that a direct test of the identity of Ky and 
cannot be made. However, these considerations do show that for the range of 
unstable conditions involved 

j£^(observed) = Z'„i(<5omputed from equations (10) and (11)), 

provided, of course, the roughness parameter, Zq, is determined from equation (1) 
and wind-profile measurements in neutral conditions over the same surface. 


7. EVAIitTATION OS' THE HEAT-EXOHANGB COMEONBNTS 

The balance of heat-exchange processes near the earth’s surface may be expressed 
as follows. If 

Bg is the incoming solar radiation (direct and diffuse), 

Rf the reflected component of solar radiation, 

Bg the upward flux of long-wave radiation (i.e. the net effect of radiation from 
ground and atmosphere), at height z, 

8 the heat absorbed in soil, 

L the heat associated with evaporation from the ground, and 
Qg the upward turbulent flux of heat at height z, 

then, in the absence of advection effects, and neglecting the minute absorption of 
solar radiation in photosynthetic processes, 

r* dT 

= i^-l- /S-t- i-l-iZg-t- Qjj-I- J Cpp-^dz, (13) 

T, p and being the absolute temperature, density and specific heat at constant 
pressure of the atmosphere. From the present measurements all terms with the 
exception of Qg may be evaluated. Equation (13) may thus be solved for Qg, and 
finally, since by definition 

= (14) 


the magnitude of the eddy diflfusivity for heat, may be derived. Evaluation of 

the terms and L follows without further explanation. The term 




which represents the rate at which sensible heat is absorbed by the air layer up to 
height was invariably less than 0*001 cal./cm.^min. and will be omitted from 
further considerations. The terms 8 and remain to be discussed. 



Eddy diffusion of water vapour and heat near the ground 133 


The rate at which heat is absorbed in the soil per unit area of ground surface is 
given by f” , ad. 


, ad- /,,0d\ 


where z is the depth below the soil surface, d, p\ c, h' are the temperature, density, 
apparent specific heat and thermal conductivity of the soil medium. If, as fi:equently 
occurred in the present measurements, the soil temperature d passes through a 
maximum or minimum value at depth z' then 





‘di^’ 


(16) 


which may be readily evaluated graphically from the soil-temperature measure- 
ments, using time mean values of dOjdt and determined values for p’c at various 
depths. The apparent density of the soil, p', is obtainable directly from the initial 
weighings of the soil samples. Subsequent drying and reweighing gives the proportion 
of son material and water, and from this and the specific heat of the soil material 
the apparent specific heat of the soil medium may be derived by simple proportion, 
the air component being neglected. Determination of the specific heat of the soil 
material had already been carried out by the School of Agriculture, on oven-dried 
samples taken in the previous year from the present site, from depths 0 to 1, 2 to 3, 
4 to 6 and 6 to 7 in. below the surface. The method of mixtures was employed, errors 
due to heat of wetting of the material being avoided by mixing it with a known 
quantity of water and performing the determination on the mixture. No systematic 
variation with depth was apparent in the values of specific heat so obtained, and 
the mean result was 0-20 ± 0-02. The value 0-20 was employed in all subsequent 
calculations. 


The soil-temperature data are summarized in table 4 in the form of mean soil 
temperatures at the various depths for each of the observations which included 
radiation measurements. With these are given the corresponding changes of tem- 
perature during the observation period, from which mean values of ddjdt may be 
derived by dividing by the effective observation period (57 min.). Since the change 
of soil temperature occurs slowly and smoothly little error is likely to result from 
this simplified procedure. At the bottom of table 4 are given the data relevant to 
the three sets of soil samples, the values of p'c finally obtained being used in con- 
junction with the corresponding temperature observations. It may be noted that 
the water content of the 0 to 1 in. layer includes that associated with the vegetation 
cover and in the evaluation of p'c for this layer it is implicitly assumed that the specific 
heat of the solid material of the vegetation is the same as that of the soil solids. 
Since the vegetation cover could only represent a minute fraction of the mass of 
the 0 to 1 in. layer it is obvious that the error so involved is completely negligible. 
From graphs of p'c against the mean depth of layer, smooth values were read 
off for the levels of temperature measurement (down to 8 in.). For observations in 

‘ BO 

which z' was less than Sin. (nos. 9, 10, 16, 1 6, 19, 20, 24, 26 and 28) values of p'c ^ were 



134 


F. Pasquill 


then plotted against depth z and expression (16) evaluated by measuring the area 
between the z-axis and the curve so obtained, with a planimeter, over the limits 
z = 0, z = z'. 

In the cases when the soil temperature did not pass through a maximum or 
minimum above the Sin. level it was necessary to use equation (16) with z = Sin., 
and to evaluate h' at this level. This was readily achieved from the previous cases 
{%' less than Sin.), since we may write 


r 


s— 8in. 


pc^dz 



(17) 


the left-hand side of the equation being evaluated as above and dOjdz being obtained 
by drawing tangents to the soil-temperature profiles. Inspection of the soil-tem- 
perature data summarized in table 4 indicates the small value of dOjdz normally 

dd 

occurring in the region of 8 in. depth, so that in general the term ¥ ^ is of the nature 

of a small correction. The magnitudes of ¥ obtained as described from four observa- 
tions (nos. 9, 10, 19 and 20) in which the magnitudes of ddjdt and ddjdz were highest, 
were 20, 58, 18 and 35 x 10"“^cal./°Ccm.sec. respectively, and a mean value of 
33 X 10”^ was used throughout for the few observations (nos. 6, 7, 8, 17 and 18) 
where it was necessary to apply equation (15). 

The long-wave radiation component from ground and atmosphere was derived by 
the empirical method described by Robinson (1947), employing for convenience the 
radiation chart described on p. 146 of that article so as to obtain directly the net out- 
ward radiation at a given level. The ‘water-path ’-temperature curve below 2 m. was 
constructed from the present temperature and humidity profiles, and the ‘radiative 
temperature’ of the ground measured as previously described, using intervals 0 to 
37-5, 37-5 to 50, 50 to 75, 75 to 100, 100 to 150, 160 to 200 cm. For the remainder of 
the curve the necessary values of upper-air temperature and humidity (starting from 
the 1000 or 950 mb. levels, according as the surface pressure was above or below 
1025 mb.) were interpolated from previous and succeeding Downham Market 
ascents. In all cases inspection of the synoptic situation suggested that no sub- 
stantial differences in air-mass properties were likely to exist over the distances 
separating the present site and Downham Market (30 miles). Furthermore, the 
observations for which these computations were made were carried out in good 
visibility with in general less than 3/10 diffuse high cloud (see cloud data in table 3), 
the cloud being in most cases at a fairly low angular elevation. No attempt has been 
made to allow for the effects of these small cloud amounts. The worst case is observa- 
tion no. 20, when the mean cloud amount was 3/10 Ci and 2/10 Ac, and this has been 
included deliberately in the present analysis so as to provide some appraisal of the 
errors likely to accrue from neglect of the cloud factor. 


8. Magnitudes of the eddy diffusivity for heat 

The measured and computed heat components are listed in table 5, and the 
balance is attributed to turbulent transport. The values are specified to 0*001 
cal./cm.2 ^ though we shall now see that errors considerably in excess of this may 



135 


Eddy diffusion of water vapour and heat near the ground 

be expected to occur. For the estimation of long-wave radiation, Eobinson (1947) 
suggests that the net outward radiation with clear skies is given by the radiation 
chart with a probable error of ± 0-015 cal./cm.^min. The comparison of the solari- 
meter with a Gorczynski or Angstrom instrument suggested that the indications of 
the former may be in error up to ±2 %, i.e. ± 0-015 cal./cm.^min. in the maximum 
values of incoming solar radiation measured in the present investigation. As for the 
errors likely to result from heterogeneity of the thermal properties of the soil it may 
be noted that if point-to-point variations in soil-moisture content are of the order of 
the differences shown by the separate measurements summarized at the bottom of 
table 4, then these could lead to errors of up to ± 10 % in the heat capacity per unit 
volume of soil and hence ± 0-015 cal./cm.^min. in the maximum value of the heat 
absorbed in the soil. These few examples suffice to indicate the very considerable 
casual error which might accumulate in the balance of the various factors. Further- 
more, the present observations include two cases in which a direct assessment of 
the resultant error is possible, since for observations nos. 8 and 18, which were 
carried out on widely separated occasions, the gradient of potential temperature was 
practically zero. In point offact the values at 75 cm. were respectively 2 x 10“^‘^C/om. 
and — 1 X 10“^° C/cm. respectively. The turbulent transport of heat should therefore 
be very small and the remaining components, i.e. those measured and estimated, 
should almost balance. Actually it will be seen from table 5 that the greater dis- 
crepancy occurs in no. 18 and there amounts to 0-04cal./om.2min., a figure which 
seems fairly consistent with the individual errors cited above. 

We now apply equation (14) and derive apparent values of for the re- 

IdT \ 

maining observations. The appropriate magnitudes of + 75 cm. and the 

resulting values of jK^jy at this level are reproduced in table 5, together with the 
corresponding values of Ky. It will be seen that the two diffusivities are of similar 
orders of magnitudes and range from 10^ to 10^. A more critical comparison is con- 
veniently made in terms of the non-dimensional parameters 

.ad 

which are also tabulated in table 5 and plotted against the Richardson number at 
the same level in figure 5. A quite remarkable identity is shown between Kjj; and Ky 
in stable conditions, while with the increase of instability the former coefficient is 
systematically the greater of the two. With the exception of observation no. 20 all 
values of iTjy conform to the line drawn through them to a degree which corresponds 
to errors in the turbulent transport component of well within ± 0-04 cal./cm.^min. 
This consistency is better than might be anticipated from the previous discussion 
of the magnitudes of casual errors. 

A feature which we have not considered/so far is the possibility of systematic 
error in unstable conditions. In order so to explain the observed differences in 
and Ky the error would have to be in the form of an overestimation of the turbulent 
heat flux to an extent of about 100 %, or a substantial underestimation in the heat of 
evaporation term. Considering the heat-balance terms there is no reason to suspect 



136 


r. Pasquill 

any appropriately systematic error in the measured terms, solar radiation, soil heat 
and heat of evaporation, and the only other term is the computed long-wave radia- 
tion flux. In the computation of this factor the effects of small amounts of high cloud 
were neglected, but this would result in a slight overestimation of the upward flux 
of long-wave radiation and hence an underestimation of the turbulent heat flux. 
IFurthermore, the computation assumes the radiative temperature of the ground to 
be given in all cases by a spirit thermometer laid on the ground. Actually Robinson’s 
unpublished data suggest that with the sun’s altitude above 30° the method over- 
estimates the radiative temperature. As far as can be inferred from this somewhat 



Figtob 6. Observed values of eddy diffusivities for heat (Kg) and water vapour (Ky) in 
relation to wind velocity gradient (Su/dx) and Richardson number {Bi) at 75 cm. over 
a short grass surface. 


group 

11 

li 

A 

® 

• 

G 

A 

A 

O' 

□ 

■ 

D 

❖ 

♦ 


(a = 76 cm.) 


limited data the error in the present computations would amount at most to an 
overestimation of the order of 0-01 cal./cm.^min. in the upward flux of long-wave 
radiation and hence a corresponding underestimation in the upward turbulent flux. 
Again, Robinson (1947, p. 146) has observed that the computed downward flux of 
atmospheric radiation may be too high. This would lead to an overestimation of the 
turbulent heat flux. However, the indications of Robinson’s measurements are that 
this error could be of the order of 0'02cal./cm.^min. These features are clearly 
insufficient to modify the higher values of the turbulent heat flux (c. 0*20 cal./cm.® 
min.) to the required extent. 

The above considerations of the accuracy of estimation of the turbulent heat flux 
provide no grounds for suspecting that the observed differences in Kjr and Kjj are 
other than genuine and of the correct order of magnitude. Although it is not proposed 
in this paper to deal in any detail with theories of turbulent mixing it may now be 




Eddy diffusion of water vapour and heat near the ground 137 

noted that, in qualitative implication at any rate, the present results are not entirely 
unexpected. In a recent treatment of convection near the ground Sutton (1948) has 
assembled evidence for the existence of a difference in heat and momentum transport. 
Furthermore, Priestley & Swinbank (1947) have recently put forward a modification 
of the classical theory of turbulent transport of heat, taking into account buoyancy 
effects. The latter treatment possesses implications as regards the transfer of physical 
quantities other than that of heat, and provides reasons for expecting different 
values of the eddy diffusivity to apply to the transport of different properties. As 
far as is known the present results provide the first direct experimental demonstra- 
tion of the existence of this feature in the cases of matter and heat. 


9. Application to the indirect evaluation op natubad evaporation 

In conclusion, it is of interest to examine one immediate practical application of 
the results presented here. Before doing so a brief review of the essential conclusions 
is probably desirable. Measurements have been made of the factors which define 
the eddy diffusivity for water vapour, and a comparison made with corresponding 
measurements made elsewhere of the factors defining eddy diffusivity for momentum 
for flow in the absence of thermal stratification. Identity of these two diffusive 
properties of the atmosphere is thus directly confirmed. No such direct comparison 
has been possible for a thermally stratified atmosphere, but the results indicate the 
extent of the modifications imposed on the eddy diffusivity, and in so far as a recent 
functionally established law relating wind shear and shearing stress may be accepted 
explicitly the identity of Ky and is shown to hold in unstable thermal stratifica- 
tions, though not in stable stratifications. However, a rapid decrease of stability 
effect with decrease of height above the boundary is demonstrated. Finally, measure- 
ments of the turbulent heat flux, from heat-balance considerations, lead to values of 
which are reasonably equal to those of Ky in stable conditions but are sub- 
stantially and systematically higher in unstable conditions. No quantitative ex- 
planation can at present be offered for the latter feature, though qualitatively it is 
consistent with recent modifications of the theory of turbulent transport. Due 
regard should be given to the fact that these results have been obtained in a shallow 
surface layer of the atmosphere on a single well-exposed level site, and with this 
qualification in mind we may now consider their bearing on the problem of evaluating 
natural rate of evaporation from other more easily measured quantities. 

The difficulty of performing direct measurements of natural evaporation as a 
routine procedure, as distinct from the special test measurements employed here, 
has led to two distinct meteorological approaches to the problem. The 'heat- 
balance^ method, initiated by Angstrom (1920) with reference to evaporation loss 
from lakes, and applied recently to land evaporation by Penman (1948), depends 
fundamentally on the assumption of identity in Kjg; and Ky. Thus in our nomen- 
clature the heat-balance equation may be written 


( 18 ) 



138 F. Pasquill 

where L' is the latent heat of vaporization, so that with the appropriate measure- 

0Y 

ments or estimates Ky and hence i.e. the rate of evaporation, may be 

derived. The advantage claimed for this method is that except for the above assump- 
■^on no knowledge of the processes of turbulent transport is required. The present 
results only support this assumption in stable conditions. In unstable conditions, 
for which values of Kj^ approximately double those of Ky are indicated here, con- 
siderable error could arise in the computed rate of evaporation. It is easily seen from 
the above equation that the error depends on the relative magnitudes of the vertical 
gradients of temperature and absolute humidity, and that when the former is 10« 
times the latter, a factor which was frequently attained and occasionally exceeded in 
the present observations, the computed rate of evaporation would be 33 % too high. 
Thus on the present indications the method may only be applied with confidence 
when the vertical temperature gradient (measured upwards) has a positive or a 
relatively small negative value, and is likely to be seriously in error for large negative 
gradients of temperature, when, other factors being constant, evaporation occurs 
at a maximum rate. 

The ‘hydrodynamical’ method has been used by Sverdrup (1936) and others for 
the estimation of evaporation from the sea and applied recently by Thornthwaite & 
Holzman (1942) and Penman (1948) to land evaporation. Thornthwaite & Holzman’s 
method assumes identity of Ky with computed from a wind-profile law of the 
form of equation (1). The validity of this assumption is established here for air fiow 
over a land surface in the absence of thermal effects. For convenience of application 
we may note that integration of equation (5) over the height range to z^, with Ky 
of the form given for K^ in equation (2), leads, on rearrangement in terms of 
equation (1), to 

El /1f,\ 

^ 7 

Ki-:) 

which is the equation applied generally by Thornthwaite & Holzman, Evaluation 
of the rate of evaporation thus requires the measurement of wind speed and absolute 
humidity at two heights, the value of the aerodynamic constant k being 0-4. In 
unstable conditions this equation no longei^ holds, since Ky and K^ are now given 
by equation (11). From this we have, by a similar process of integration, 


{l-fiY (Xi - Xa) (% - «i) 


( 20 ) 


which is again evaluated by measurements of wind speed and humidity at two 
heights, Zq being obtained from equation (1) and measurements in neutral conditions, 
yff from equation (12) as described in § 6. The indications of the results discussed in 
§ 5 and at the end of § 6 are that the above equations may be used to prescribe the 
rate of evaporation over short periods (1 hr. or more) with an accuracy usually within 
± 20 %, while over extended periods some improvement in this could reasonably be 
expected. In stable conditions neither of the above expressions can be supported 



Eddy diffusion of water vapour and heat near the ground 139 

■by experimental data, and at present it is necessary to appeal to the observed decrease 
of stability influence as the boundary is approached and to apply equation (19) as 
an approximation, with measurements made preferably in the first Jm. above the 
surface. Since in general the rate of evaporation from a land surface in stable con- 
ditions of air flow may be expected to be of a low order, the error incurred by this 
approximation in estimating daily or longer period evaporation loss will rarely be 
of importance. 

It should be noted that if the height of the vegetation cover is not very small 
compared with the heights above the soil surface at which wind speed and humidity 
are measured, the wind-profile laws are only satisfied by applying a ‘zero- displace- 
ment ’ correction (see, for example. Deacon 1949), which is attributed to the fact that 
the boundary from which turbulent transport is effective is at some distance above 
the soil surface according to the height and density of the vegetation. Thus equation 
(1) becomes 



and equations (10), (19) and (20) are similarly modified in the z term. When, as is 
usual, it is not possible to estimate d directly with sufficient precision, it is necessary 
to measure the wind speed at three heights in neutral conditions and to evaluate d 
graphically from the equation 


^ 2 -% ^ log(z2-d!)-Iog(gi-^) , 22 ^ 

Wg-Wi log iz^-d) -log {Zj^-dy ^ 

which follows from equation (21). 

Finally, it is desirable to emphasize that the demonstration provided here of the 
superiority of the present hydrodynamical approach over the classical heat-balance 
method is contrary to conclusions drawn by Penman (1948). It should be noted, 
however, that while favouring the latter method Penman fibads that it overestimates 
the daily free-water evaporation loss in midsummer, a result which would be expected 
from the present analysis. Furthermore, the hydrodynamical approach which 
Penman concludes to be inadequate differs fundamentally from the present method 
and is founded essentially on an empirical modification of a treatment which is 
basically justified only in the case of a so-caUed ‘aerodynamicaliy smooth’ surface, 
a condition which is recognized to be the exception rather than the rule over a 
natural land surface (see, for example, Sheppard 1947, p. 217, for a brief discussion 
of this feature). 

Acknowledgement is gratefully made to the Head and members of the School of 
- Agriculture and the Director of the University Farm, Cambridge, for facilities 
provided during the course of this work, to the Director of the Meteorological Office 
for permission to publish the results obtained and to Mr J. E, Skilling of the Meteoro- 
logical Office for assistance in the observational and analytical work. 



F. Pasquill 


^140 


References 

Angstrom, A. 1920 Qeogr, Ann.^ Stockh,, 2, 237. 

Brant, D. 1939 Physical and dynamical meteorology ^ 2n(i ed., Cambridge Univ. Press. 
Calder, K. L. 1949 Quart. J. Mech. App. Math. 12 (in press). 

Deacon, E. L. 1949 Geophys. Mem. (to be published). 

Ertel, H. 1933 Met. Z. 50, 386. 

Holzman, B. 1943 Ann. N.Y. Acad. Sci. 44, 13. 

Nikuradse, J. 1933 Forschungsh. Ver. dtsch. Ing. no. 361. 

Paisquill, F. 1943 Proc. Roy. Soc. A, 182, 75. 

Pasquill, F. 1949 Quart. J. R. Met. Soc. 75, no. 325 (in press). 

Penman, H. L. 1948 Proc. Roy. Soc. A, 193, 120. 

Prandtl, L, 1932 Beitr. Phys.frei Atmos. 19, 188. 

Priestley, C. H. B. & Swinbank, W. C. 1947 Proc, Roy. Soc. A, 189, 543. 

Riobinson, G. D. 1947 Quart. J. R. Met. Soc. 73, nos. 315-316, pp. 127-150, and unpublished 
results. 

Rossby, C. G. & Montgomery, R, B. 1935 Pap. Phys. Oceanogr. 3, no. 3. 

Schlichting, H. 1936 Ingen. Arch. 7, 1. 

Sheppard, P. A. 1940 J. Sci. Instrum. 17, 218. 

Sheppard, P. A. 1947 Proc. Roy. Soc. A, 188, 208. 

Sutton, 0. G. 1934 Proc. Roy. Soc. A, 146, 701. 

Sutton, 0. G. 1947 Quart. J. R. Met. Soc. 73, nos. 317-318, pp. 257-281. 

Sutton, 0. G. 1948 Quart. J. R. Met. Soc. 74, no. 319, pp. 13-30. 

Sverdrup, H. U. 1936 Ann, Hydrogr.^ Berl.^ 64, 41. 

Taylor, G. I. 1915 PMl. Trans. A, 215, 1. 

Taylor, G. I. 1917 Proc. Roy. Soc. A, 94, 137. 

Taylor, G. I. 1931 Rapp. Cons. Explor. Mer, 76, 35. 

Thornthwaite, C. W. & Holzman, B. 1942 Tech. Bull. U.S. Dep. Agric. no. 817. 



The Royal Greenwich Observatory 
By Sees Harold Spbkobr Jones, F.E.S., Astronomer Eoyal 
{Lecture delivered 27 January 1949. Received 27 January 1949) 

pPlates 1 to 4] 

The Royal Observatory was established by King Charles II in the year 1675 for the 
specific practical purpose of ‘rectifying the tables of the motions of the heavens, and 
the places of the fixed stars, so as to find .out the so-much-desired longitude of 
places for the perfecting the art of navigation’. At that time the most accurate star 
catalogue available was the catalogue of 1000 stars, prepared by Tycho Brahe 
about 1598; only 777 of the stars had been properly observed, and the star places, 
whose average errors were of the order of 1' to 2', were not sulBBlciently accurate for 
the purpose of determining longitudes. The best tables for giving the position of the 
Moon were liable to errors as great as 20'. Flamsteed, who was appointed the first 
Astronomer Royal, realized that a good stock of observations, continued for many 
years, was needed in order to provide star places with all the accuracy that was 
attainable and to furnish positions of the Sun, Moon and planets which could serve 
as the basis for the construction of satisfactory tables of their motions. So, firom its 
very foundation, the Observatory started upon systematic and long-continued 
programmes of observation which, throughout its history, have formed its most 
significant and important contribution to astronomy. 


Meridian astronomy 

.Until after the middle of the eighteenth century, when WMam Herschel embarked 
upon his studies of the structure of the sidereal system and upon his observations 
of star clusters, nebulae and double stars, astronomical observation was concerned 
almost entirely with the positions and motions of the heavenly bodies. But long 
after HerscheFs pioneer work had opened up new fields of investigation, the 
observations at Greenwich continued to follow closely the lines originally laid down. 
There were improvements, of course, in the design and construction of instruments 
and in their optical quality; there were progressive refinements m technique and 
progressive improvements in accuracy, which in turn opened up new fields of 
investigation. The observations which Bradley made at Greenwich between 1750 
and 1762, amounting to about 60,000 in all, were of a higher accuracy than any 
made previously and are, in fact, the earliest observations which are precise enough 
to be of use to the astronomers of to-day. Bradley was particularly careful in 
examining the errors of his instruments and in keeping the instruments in the best 
adjustment; he was the first to introduce corrections to atmospheric refraction for 
the temperature of the air and for the height of the barometer. 


Vol. 198. A. (15 August 1949) 


[ 141 ] 


10 



142 Sir Harold Spencer Jones 

Halley in 1718 caUed attention to the fact that three of the bright stars, Sirius, 
Procyon and Arcturus, had changed their positions since Greek times, and that 
Sirius had perceptibly changed its position since the time of Tycho Brahe; the 
fixed stars were not, in fact, fixed, and the study of their proper motions added 
a new interest to the determination of stellar positions. Bradley himself discovered 
the two phenomena of stellar aberration and of nutation. William Herschel, by 
analyzing the proper motions of fourteen stars which had been determined with 
accuracy by Maskelyne at Greenwich was able to show that the Sun itself had 
a motion relative to the stars. The improvements in the tables of the motions of the 
Sun, Moon and planets, which resulted from the progressive increase in accuracy 
of observation and from the fact that observations were made all round their orbits, 
enabled the great Continental mathematicians of the eighteenth century, Euler, 
Clairaut, D’Alembert, Lagrange and Laplace, to prove that within the limits of 
accuracy of observation the movements of the bodies in the solar system could be 
accounted for in detail on the sole hypothesis of the Newtonian theory of 
gravitation. 

These facts are mentioned to emphasize that the determination of the positions 
and motions of the Sun, Moon, planets and stars is work of fundamental importance 
in astronomy. It demands continuity of observations over a long period, made with 
the greatest care and precision. It was the work for which the Observatory was 
founded; it is the work which must always be its first concern, for it is work that is 
never completed but that will always go on. In 1876 Airy, after reviewing the work 
of 40 years at Greenwich since his appointment as Astronomer Eoyal, in the course 
of which he had expanded the work into various new directions, remarked : 

^Turning now from the past to the future, I see little in which I cotfid suggest 
any change. If it should ever be necessary to make any reduction, I should propose 
to withdraw Meteorology, Photoheliography, and Spectroscopy ; not as unimportant 
in themselves, or as ill-fitted to the discipline of the Observatory, but as the least 
connected with the fundamental idea of our Establishment.’ 

Until well into the nineteenth century, meridian telescopes for the determination 
of position formed essential equipment of most observatories. Many amateurs even 
made regular meridian observations and provided significant contributions to 
positional astronomy. But as new fields of investigation in astronomy have been 
developed, meridian observations have been discontinued at most observatories. 
They make heavy demands on observing resources and involve much computational 
work. Eor this branch of astronomy, continuity of observations with the same 
instruments over a long period is of the greatest value. Such observations are 
therefore not well fitted for University or private observatories. It is proper that 
they should be undertaken almost entirely, as in fact they are to-day, by the 
national observatories. In recent years the systematic pursuit of meridian observa- 
tions has tended to be restricted more and more in the northern hemisphere to three 
observatories, Greenwich, Waishington and Pulkowa. During the war the Pulkowa 
Observatory was completely destroyed, though it is now being rebuilt on a larger 
scale, and meridian observations will continue to form an important portion of its 
work. At the end of 1940 meridian observations at Greenwich had to be dis- 



The Royal Greenwich Observatory 143 

continued for the first time in the history of the Observatory; the complete dis- 
continuance was not for long, but until after the end of the war it was possible to 
carry on the observations only on a small scale. 

The purpose of meridian astronomy is to provide a fundamental system of 
reference, defined by the positions and proper motions of a network of stars 
distributed with reasonable uniformity over the whole sky, together with the 
numerical value of the constant of precession. The observations are made from 
a moving and rotating earth and it is convenient to use the equator as a plane of 
reference and the equinox as .a zero point. Both the equator and the equinox 
depend upon the motion of the Earth and the motion of its axis ; they can therefore 
be detern^ined by observations of the Sun which, in effect, fix the orbit of the 
Earth, or by observations of the bright inner planets, which involve the elements 
both of the orbit of the planet and of the orbit of the Earth. Observations of the 
Sun are peculiarly liable to errors of a systematic nature, while observations of the 
inner planets, . showing perceptible disks and phases, are liable to personal errors 
which are different from those that affect star observations. There is the further 
difference that observations of the Sun and planets are made by day, while the star 
observations are mostly made at night. 

The position of the pole can be determined from observations of circumpolar 
stars, and the position of the nadir can be determined from observations with 
a mercury horizon. The two together will fix the equator point. There are normally 
systematic differences between the equator point fixed in this way and the equator 
point deduced from Sun and planet observations. The circumpolar observations, 
above and below pole, are made 12 hr. apart; they may be affected by systematic 
diurnal effects, by errors in the corrections applied for refraction, by changes in the 
instrumental adjustments, by instrumental flexure, which is different for the two 
observations, and by errors in the adopted figure of the instrumental pivots. There 
are consequently many possibilities of errors of a systematic or quasi-systematic 
nature. When observations of stars in the equatorial belt made at a northern and 
at a southern observatory are compared, systematic differences are generally found. 
The observations can be brought into agreement by adjusting the coefficients of 
refraction used in the reductions of the observations at the two observatories, but 
the corrections so obtained are usually found to be quite inadmissible. 

The observations made with the conventional t3q)e of transit circle are, in fact, 
liable to many sources of error which are difficult to control adequately. If the same 
stars are observed with two different transit circles and the derived positions are 
compared, it is found that, in addition to random errors of observation, there are 
systematic errors in both right ascension and declinations which vary with right 
ascension and also with declination. Magnitude error in right ascension, which 
affected observations made by the old methods of eye and ear and of hand-tapping, 
have been practically eliminated by the use of the impersonal micrometer. 

The sources of error with the conventional transit circle are numerous. The errors 
of instrumental adjustment — of level, of azimuth, and of coUimation — are con- 
tinually varying. The variations are difficult to control adequately. If the meridian 
opening in the transit pavilion. is narrowj refraction anomalies are inevitable; with 


10-2 



144 


Sir Harold Spencer Jones 

a wide opening, the instrument is fully exposed to the wind and to temperature 
changes. The errors of azimuth are particularly difficult to control, unless the site 
is one which permits of fixed azimuth marks of high stability. Corrections must be 
applied for any departure of the figures of the pivots from perfect cylindricality; 
the determination of the figures of the pivots with accuracy and with freedom from 
spurious elliptical terms is not easy. Instrumental flexure can be determined only 
in the horizontal position of the telescope, so that a law of variation with zenith 
distance must be assumed. The telescope is turned in the course of observation into 
all sorts of positions, with possibilities of displacements of objective or of the 
mechanical parts of the micrometer, which may be sufficiently small to escape easy 
detection but which can introduce serious errors of a systematic nature. 

The dilBficulty of eliminating systematic errors of instrumental origin can be 
illustrated by the reversible transit circle of the Cape Observatory, which was 
designed by Sir David Grill with much thought and care. The telescope can be 
reversed in its bearings, giving two positions for observation, denoted by B and W ; 
the object glass and eye-end are arranged to be interchangeable, giving two 
arrangements denoted by I and II. Four separate combinations are therefore 
possible. After allowing for the difference in horizontal flexure in the two conditions 
I and II, the difference in the decimations measured in the two conditions has 
a regular run amounting to 0"*4 from dec. + 40° to the south pole. The difference 
between the two positions E and W in both conditions has a range of 0"*18, but the 
difference is systematically larger in condition II than in condition I, the extreme 
differences being 0"*3 and 0"*2 respectively. The mean of the observations in the 
four separate combinations is adopted as likely to provide the closest approximation 
to the truth. 

Because of these systematic errors of instrumental origin, the proper motions of 
stars must be based largely upon series of catalogues observed over many years with 
the same instrument, whereby the instrumental peculiarities are to a large extent 
eliminated. The observations made at Greenwich are of special importance for this 
purpose, as the Airy transit circle, which defines the prime meridian of longitude, 
has been used continuously since 1851. With this instrument more than 660,000 
observations have been made. The last three programmes consisted of the observa- 
tions of all the stars down to about the 8th magnitude in the zones of declination 
0° to 4- 32°, + 32° to -h 64°, + 64° to + 90°, supplemented by fainter stars in regions 
of high galactic latitude, together with numerous observations of a large number 
of fundamental stars. Thus the whole of the northern sky has been covered. The 
proper motions of the stars have also been investigated. The Airy instrument does 
not accord with modern ideas for the construction of a transit circle; it is not 
reversible in its bearings ; it is housed in a pavilion which has buildings on each side 
with a narrow opening and liability to refraction anomalies ; the terrain is asym- 
metrical to north and south, with the possibility that the atmospheric refractions 
north and south of the zenith may not be equal. The instrument has now reached 
the end of its useful life. A new transit circle, reversible and housed in a pavilion 
of semi-cylindrical shape, with a wide aperture and a fairly symmetrical terrain to 
north and south, was installed shortly before the war. Before bringing this instru- 



145 


The Royal Greenwich Observatory 

ment into regular use, it has been subjected to a lengthy series of investigations, in 
the course of which the errors of the circle graduations and of the figures of the 
pivots have been determined with great accuracy. These investigations have thrown 
much light upon various ways in which the observations can be affected by 
systematic or quasi-systematic errors of instrumental origin. 

These remarks have been made to illustrate the difficulties and complexities of 
meridian astronomy and to explain why observations of position must still be 
continued. The observations made at different times and with different instruments 
at different observatories are combined, by a process of adjustment which is to 
a large extent empirical, to form what is termed a fundamental system, so pro- 
viding a system of reference that is more accurate than it is possible to obtain from 
observations with a single instrument and, moreover, covering the whole sky. The 
more effectively the various sources of error can be eliminated or controlled with 
each particular instrument, the more reliable the fundamental system will be. Both 
the places of the stars and their proper motions, as given in the fundamental system, 
are affected by errors ; the errors of the star places increase with the lapse of time 
from the epoch of the system, so that successive revisions of the system become 
necessary. A process of gradual approximation, which still continues, is involved. 

With the realization of the serious difficulties involved in the use of a large 
movable telescope for the accurate determination of positions, it is natural that 
consideration should be given to alternative designs of instruments in which the 
movable parts are reduced as much as possible. The use of a fixed telescope in 
conjunction with a moving plane mirror has more than once been suggested; 
a design of such a mirror transit circle has been developed at Greenwich. It is 
proposed to employ two fixed horizontal telescopes, with their axes in the meridian 
and their objectives facing a movable mirror, whose plane is parallel to the east- 
west axis of rotation. Stars would be observed in one or the other telescope 
according to whether they transit north or south of the meridian, while the two 
telescopes would serve also as collimators. Such an arrangement reduces moving 
parts to a minimum. The fixed telescopes can be of longer focal length than is 
convenient for a movable telescope, and they can be effectively insulated against 
rapid temperature changes. Observations of nadir, of level and of near-zenith stars 
can be taken in either telescope. Flexure effects are reduced to a minimum, while 
troubles arising from small displacements of micrometer parts or of objectives are 
entirely eliminated. The adaptation of a variable-speed motor drive to the micro- 
meter wire is simplified. A small model of the proposed design has been constructed 
and the theory of the instrument has been investigated. The instrument appears to 
have great possibilities and it is hoped to try it out in practice. 

Closely related to the meridian astronomy is the determination of the variation 
of latitude caused by the movement of the Earth’s axis of rotation relative to the 
Earth. The pole has an irregular motion within a circle of about 30 ft. radius; it 
contains two principal components, of periods 12 and 14 months, but the motion 
is not sufficiently regular to predict ahead. The component of the motion along the 
meridian of a place causes a change of latitude; the component in the perpendicular 
direction affects time determinations. The complete motions can be determined by 



146 Sir Harold Spencer Jones 

observations of the variations of latitude at two places whose longitudes differ by 
about 90°. The variation of latitude was first established in 1888; Chandler 
afterwards found it clearly exhibited in observations back to 1760. Determinations 
were first made at Greenwich with the Airy refiex zenith tube, but the observations 
were not entirely satisfactory. A long series of observations was com m enced in 
1911 with the Cookson floating zenith telescope, the observations being -made 
photographically. Pairs of stars at nearly equal distance north and south of the 
zenith are observed at meridian passage ; each star makes a trail across the plate, 
the telescope being rotated through 180° between the observatioils of the two 
stars. The separations of the pairs of trails are measured, and the variations in 
latitude are deduced from the changes in these separations. Meridian observations 
of declination are corrected for the variation of latitude. A new photographic 
zenith tube, now under construction, will be used, when completed, for the 
determination of the variation of latitude as wed as for time determination. Tfie 
study of the polar motions raises many interesting problems and provides a means 
for the determination both of the constant of nutation and of the constant of 
aberration. 


Time Department 

The provision of a time service is a normal function of a national observatory 
and is closely related to the work of the meridian department. The right ascension 
of a star is the sidereal time of meridian transit of the star. In positional astronomy 
a selected number of the brighter stars in the equatorial belt, suitably distributed 
round the sky, are selected as ‘ clock stars ’. The positions and proper motions of the 
clock stars, as used at Greenwich, have been derived from the long series of meridian 
observations and are progressively refined as the observations continue. Observa- 
tions extending from 6 to 12 hr. serve to control periodic errors in the right ascensions 
of the clock stars. Using these adopted right ascensions, the observations of the 
clock stars determine the errors of the standard sidereal clock night by night. The 
right ascension of any other star is derived by correcting the observed sidereal time 
of its transit for the clock error, interpolated for the time of transit. 

Before 1927 the time determined at the Royal Observatory was based upon 
observations with the Airy transit circle. In 1926 a world programme of longitude 
determinations, in which a large number of observatories in all parts of the world 
participated, was undertaken under the auspices of the International Astronomical 
Union. Tor this special programme, a small reversible transit circle was used at 
Greenwich, the telescope being reversed near the middle of each transit, thereby 
eliminating the correction for collimation error. Tor such a programme it is 
necessary to adopt a common system of star places, in order to ensure that the 
derived longitudes are as free as possible from systematic errors in star positions 
determined with different instruments ; in this particular programme the star places 
in Eichelberger’s Fundamental Catalogue were used. It was found that the clock 
errors derived from these observations had a much smoother run than the clock 
errors derived concurrently from observations with the Airy transit circle. The 
latter are affected by obscure instrumental errors, to which reference has already 



The Royal Greenwich Observatory 147 

been made. It was found, moreover, that when the smoother clock errors provided 
by the small transit observations were used for the reduction of the Airy transit 
circle observations, the derived right ascensions of the stars became discordant ; but 
that when the more irregular errors given by the transit circle observations were 
used, the derived right ascensions became accordant. The instrumental peculiaj*ities 
of the transit circle are evidently involved. 

Since 1927, therefore, the time determinations have been based entirely upon 
observations with small -transit instruments, which are reversed ia the middle of 
each transit. By resolution of the International Astronomical Union the revised 
Auwers fundamental system, known as the FKS, is used for the system of star 
places. The apparent places of the stars are taken from the annual volume Apparent , 
Places of Fundamental Stars, published by the Nautical Almanac Ofi&ce. 

Prior to 1923 the standard clocks employed in the Royal Observatory were 
regulator clocks with Graham dead-beat escapements. A Cottiagham clock, fitted 
with a Riefler escapement, installed after the first World War, was expected to give 
a higher standard of performance, but failed to come up to expectations. The 
development of the Shortt free-pendulum clock introduced a new standard of 
precision in time-keeping. In this type of clock a master pendulum, mounted in an 
airtight case, exhausted to a pressure of about 1 in. of mercury, and placed in 
a constant-temperature room, synchronizes a slave clock of commercial type. The 
master pendulum swings fi:eely, except when given a small impulse once each half- 
minute, and is relieved of the work of moving a train of wheels to show the time on 
dials and of sending out signals. The first clock of this type, Shortt no. 3, was 
installed at the Observatory in November 1924 and soon showed its superiority 
over other types of pendulum clock. Other clocks of this type were therefore 
installed and used both as sidereal and as mean time standards in the 
Observatory. 

The iatroduction of the new standards resulted in an important change in the 
system of time employed. The transit of the first point of Aries or the vernal 
equinox defines the beginning of the sidereal day, 0 hr. sidereal time. But the 
processional motion of the true equinox is not uniform, being affected by 
irregularities, due to solar and lunar perturbations, which are known as nutation. 
In consequence the sidereal day varies slightly in length. If we imagine a point 
moving uniformly along the equator, with a motion equal to the mean motion of 
the true equinox, and so that its extreme distances from the true equinox on both 
sides are equal, we may term this point the mean equinox. The sidereal time 
determined by observation is apparent sidereal time, A mean sidereal time can be 
defined by reference to the mean equinox, in which all days are of equal length ; it is 
obtained by subtracting the nutation from the apparent sidereal time. Apparent 
sidereal time was good enough before the introduction of the Shortt free-pendulum 
clocks ; their superior precision made it necessary to introduce the concept of mean 
sidereal time, which has been universally adopted. The detailed study of the 
performance of the Shortt clocks at Greenwich, which proved their superiority over 
other types of pendulum clocks, stimulated their introduction into observatories 
in many parts of the world. 



148 Sir Harold Spencer Jones 

The Royal Observatory is responsible for the distribution of time to the public. 
The first steps in the distribution outside the Observatory became possible with the 
development of telegraphic communications. In 1862 an electric clock was installed 
at the Observatory, which transmitted a signal each day that caused a time-ball, 
on the offices of the Electric Telegraph Company in the Strand, to drop. In 1865 
signals were sent hourly to the Electric and International Telegraph Company’s 
office, whence they were distributed over the railway network of the country. After 
the telegraph system was taken over by the Post Office, in 1870, a complete system 
for the hourly distribution of time through the Post Office was gradually developed, 
which made Greenwich time widely available. A further step in the widespread 
dissemination of accurate time followed naturally upon the development of broad- 
casting. Two of the Dent regulator clocks were modified to run as synchronized 
clocks under the control of one of the mean time free pendulums, and were provided 
with a system of contacts to enable time signals to be transmitted automatically 
every quarter of an hour to the British Broadcasting Corporation. The signals were 
in the form of six dots at intervals of a second, the last coming exactly at the hour, the 
quarter, or the half hour. These signals are the familiar B.B.C. ‘six pips ’ Greenwich 
time signal, which, since 6 February 1924, have been transmitted on all B.B.C. 
wave-lengths several times daily. 

A further service, designed to be of value for navigation, was commenced on 
19 December 1927. From fhat date radio time signals have been sent out on 
a frequency of 16 kc./sec. twice daily, at 10 and 18 hr. o.m.t., from the Observatory, 
via the Rugby wireless station. These signals, which last for 6 min., are of the 
so-caUed vernier type, spaced 61 to the minute, enabling the error of a chronometer 
to be accurately determined by observing the instants of coincidence between the 
signals and the ticks of the chronometer. Corrections to the times at which the 
signals were emitted are published by the Observatory at approximately monthly 
intervals, for use where higher precision is required, as, for instance, in survey 
operations. For the distribution of these signals a special ‘diminished seconds’ 
slave clock was installed, whose pendulum swings 61 times in a minute, and which 
is kept in synchronization by the mean-time master pendulum. The service has 
more recently been extended by simultaneous transmission of the time signals on 
several short wave-lengths. The introduction in 1936 by the Post Office of the 
‘speaking clock’, designed by the Post Office engineers and constructed at the 
Post Office Research Station, which is automatically controlled by hourly time 
signals from the Observatory, has made accurate time continuously available. 

It is necessary that the Observatory should keep abreast of developments in 
precision horology so that the time service can meet all demands for precision that 
are made upon it. After the development of the quartz crystal clock, it soon became 
evident that a new standard of precision in time-keeping had been reached. It was 
therefore decided to instal a clock of this type at the Observatory in order that some 
direct experience could be gained of its performance in comparison with the 
performance of the free-pendulum clocks. A clock, using a Dye-Essen ring crystal, 
was constructed under the supervision of the National Physical Laboratory and 
installed in 1939. The quartz vibrator was adjusted to have a frequency of 100 kc. 



149 


The Royal Greenwich Observatory 

\ 

per sidereal second; demultipKer circuits provided an output with a frequency of 
500 cycles per sidereal second, which was used to drive a phonic motor. Although 
the performance of this clock was not altogether satisfactory, experience showed 
that it was free from the small erratic changes of rate to which the pendulum clocks 
were Uable and which made prediction uncertain ; it was found also that it would be 
more convenient, when additional quartz clocks were installed, to employ a funda- 
mental frequency of lOOkc. per mean solar second instead of per mean sidereal second. 

The outbreak of war stopped for a time developments which had been planned 
for an improved time service. A skeleton time service was at once installed at the 
Abinger Magnetic Station as a safeguard against the possibility of the service from 
Greenwich being put out of action. Towards the end of 1940 when the frequent air 
raids made night observations impossible at Greenwich, the time service was moved 
in its entirety from Greenwich to Abinger. Shortly afterwards a second time 
service was installed at the Royal Observatory, Edinburgh, with the co-operation 
of the Astronomer Royal for Scotland. Eor the remainder of the war, the two time 
stations, at Abinger and Edinburgh, were in use; the two stations were connected 
by direct land line with teleprinter communication, enabling clocks at either station 
to be recorded at the other station. 

Plans were proceeded with for the installation of quartz-controlled frequency 
standards. The first group of three clocks was installed in 1943. These clocks, which 
were constructed at the Post Office Research Station, were of the Post Office 
Group IV type, in which a GT-cut plate of quartz is maintained in oscillation at 
100 kc. per mean time second, by a bridge drive circuit, which reduces to a minimum 
the effect of variation in the supply voltages. The crystals are mounted in thermo- 
statically controlled ovens, and the temperature range permitted by the modified 
Turner circuit employed ensures that variations in frequency from this cause are 
small. Regenerative frequency dividers are used to provide an output at 1000 c./sec., 
which can operate phonic motors, provided with contacts from which signals can 
be taken off. 

The installation has since been considerably extended and the time service is now 
based upon six groups of three quartz crystal clocks, four of the groups being at 
Abinger, where the main time-service station has remained, and two at Greenwich, 
enabling a time service of much higher precision to be provided than was possible 
with pendulum clocks. The quartz clocks have the further advantage that relative 
errors and rates can be obtained much more readily and with much greater accuracy 
than with pendulum clocks. Eor measuring time and frequency differences, decimal 
counter chronometers are used, embodying scale-of-ten counter circuits. The 
100 kc./sec. output from one of the primary standards is fed into the counting unit. 
A seconds impulse from one clock can be applied to start the count and a seconds 
impulse from another clock to stop it. A single reading of the time difference between 
the two clocks is obtained, being shown on five dials reading successively in units of 
tenths, hundredths, thousandths, ten-thousandths, and hundred-thousandths of 
a second. To provide a check, the roles of the two clocks can be .interchanged. 
Alternatively, by throwing over a switch, a series of successive readings. may be 
obtained, when the results are added in the counter. This procedure is of value when 



150 


Sir Harold Spencer Jones 

the intervals are nominally constant but are subject to small erratic variations, as 
■when comparing a clock with a time signal; ten consecutive readings can then be 
obtained with advantage. For frequency comparisons, the nominal 100 kc./sec. 
outputs from two clocks are fed into a comparator, where the levels are adjusted 
to a standard value. The adjusted outputs are then combined and the resultant 
beats are shown on a meter. By means of a trigger circuit, a pulse is sent to the 
decimal counter when the beat-frequency voltage passes through zero. The intervals 
between the beats are thus accurately timed, enabling frequency differences to be 
determined 'vn.th an accuracy of at least one part in 10^°. The quartz clocks are 
pro'vLded in addition with automatic beat counters ; by automatic counting of the 
number of beats between each pair of oscillators in a 24 hr. period, the change in 
the time difference between each pair of clocks in the course of a day is recorded in 
units of 10“® sec. The intercomparisons between each pair of clocks, in each group 
of three, pro'vides an automatic check against incorrect action of any of the beat 
counters. 

An additional advantage of the quartz crystal clocks is that though they are 
rated approximately to mean time they can serve all purposes. Separate sidereal 
time clocks and diminished seconds transmitters for the rhythmic time signals are 
no longer required. The phonic motors, driven by the 1000 c./sec. output from one 
of the clocks, can be adapted to these additional requirements. In the phonic 
motors used at the Royal Observatory, the rotor consists of a laminated iron ring 
with 100 teeth cut on its inside surface. The six-pole stator ■within the rotor has 
corresponding teeth cut on its pole faces. The motor runs synchronously at 
10 revolutions per second, dri-ving by gearing a commutator wheel at one revolution 
per second. A contact spring, bearing on this wheel, closes an electrical circuit for 
one-tenth of each second. In order to obtain sidereal seconds, a special gearing is 

119 317 

used. The gearing ratio employed is 777 x 55-, which is about four parts in 10® 

114: OOU 

smaller than the correct ratio. This small error in the gear ratio is immaterial, for 
when the rate of the clock in mean-time milliseconds has been determined, the rate 
of the sidereal impulses, in sidereal-time milliseoonds, can be at once inferred. 

The rh3rthmic time signals consist of a long dash at the minute, followed by 
a series of dots spaced at intervals of -^min. These signals are derived from 
a contact drum, which is driven through a 60/61 gear from a phonic motor. In 
order that the signals may be sent out at the desired instants, a phasing adjustment 
is pro-rided for the signal transmitter. There are day-to-day variations in the time 
lag introduced by the land-line joining the Observatory to the Rugby ■wireless 
station. Some test signals are transmitted a few minutes before the time aignala 
themselves; these test signals are received and recorded at the Observatory and by 
comparing the times with those of the outgoing signals, the land-line lag is deduced. 
An adjustment to correct for the lag is made by rotating the phonic motor stator, 
thus advancing or retarding the phase of the rotor. A rotation of the stator housing 
by 360° advances or retards the contact time by one-tenth of a second. 

The most recent phonic motor equipment installed at the Observatory provides 
a complete contact assembly for the control of all the time signals sent out, including 



The Royal Greenwich Observatory 151 

not only the rhythmic signals but also the B.B.C. ^six pips' time signals and the 
hourly signals for the control of the Post Ofi&ce speaking clock. 

The use of quartz clocks has resulted in a much improved precision in the time 
service. Their freedom from small erratic changes of rate makes accurate short- 
term prediction possible. This is of importance for the control of precision frequency 
standards, which can be checked against a 24 hr. time interval of high precision. 
The Rugby 10 hr. time signals are transmitted so as to give an accuracy in the 
24 hr. interval between the signals on consecutive days which does not normally 
exceed 1 msec. The difficulties of accurate long-term prediction, which is needed to 
carry over periods during which no time determinations can be obtained, are much 
greater. The quartz crystal oscillators are subject to an ageing effect, which causes 
a drift in frequency, more rapid at first and gradually decreasing, though never, as 
far as present experience is a guide, completely disappearing. Time determinations 
extending over some months are needed in order to derive the frequency drift with 
the accuracy needed for prediction. But a complication is introduced by the motion 
of the Earth's poles, which causes small displacements of the meridian ; the result is 
that a perfect clock, compared with absolutely accurate time determinations, would 
appear to have a small variable error, which at Greenwich can amount to about 
± 25 msec. As the polar motion has two principal components, with periods of 
12 and 14 months, an incorrect determination of frequency drift is inevitable unless 
the effects of polar motion can be allowed for. The motion of the pole along the 
meridian can be determined by observing the changes of latitude which result from 
it; the motion in the perpendicular direction can be determined only from observa- 
tions of latitude variation at another observatory, differing by about 90° in 
longitude. The determinations of latitude variation at Washington, longitude 77° W, 
are communicated regularly to Greenwich and are used to correct for the effects of 
polar motion and thereby to derive more accurate values of the frequency drifts of 
the clocks. It is noticeable that the application of these corrections has appreciably 
smoothed the apparent errors of the clocks. 

The introduction of quartz crystal clocks has demanded an improvement in the 
precision of the time deternpinations. The period of several months, which is 
required for a satisfactory determination of frequency drift with the relatively large 
errors inherent in the time determinations with the small transit i^truments, could 
be much reduced if an appreciable reduction in the errors of observation were 
achieved. Much consideration has therefore been given in recent years to the 
design of new instruments for time determinations. 

A photographic zenith tube, which is expected to reduce the probable error of 
a time determination to a few milliseconds, is now in an advanced stage of con- 
struction. This instrument is based on Airy's design of the refl.ex zenith tube at 
Greenwich, with modifications to adapt it for photographic observation due to. 
F. E. Ross and incorporated in his photographic zenith tube, now in Washington. 
The essential principle is the employment of a zenith telescope, whose tube contains 
a mercury horizon to reflect the light and to bring it to a focus in the second 
Gaussian point of the objective, thereby making the observations practically 
independent of any error of level. The important modification introduced by Ross 



162 Sir Harold Spencer Jones 

was the inyersion of the objective, placing the flint component uppermost, whereby, 
with an appropriate separation between the two components, the second Gaussian 
point is brought a few millimetres below the lower face of the crown component. 
The observations are made photographically, the photographic plate being mounted 
in the Gaussian plane. The instrument was designed originally for the measurement 
of the variation of latitude ; the upper portion of the instrument, which carries the 
objective and the photographic plate, is in the form of a rotary, which can be turned 
through exactly 180°. If two exposures are made on a star, the rotary being turned 
through 180° between them, the separation between the two images in the direction 
of the meridian is twice the zenith distance of the star. In practice, exposures of 
finite length are given, the plate carriage being travelled along during each exposure 
with the speed of motion of the star image. If the two exposures are accurately 
timed and are approximately symmetrical about the instant of meridian transit, 
the time of transit can be inferred from the small relative displacement of the 
images in the direction perpendicular to the meridian. 

The advantages of this type of instrument for time determination are consideraljle. 
The observations being photographic, personal equations are eliminated. Errors of 
level do not affect the observations ; there is no collimation correction to trouble 
about; observations in the zenith are independent of azimuth error. As the 
instrument is fixed, the various sources of error to which a moving instrument is 
liable cannot occur. A longer focal length can be used than is possible with a moving 
instrument, with the advantage of a correspondingly greater scale. Observations 
are restricted to the zenith, where atmospheric transparency is highest and refraction 
effects are at a minimum. 

The instrument which has been designed at Greenwich differs in a number of 
important respects from the Washington instrument : 

(i) It has a larger aperture (10 in.) and longer focal length (135 in.). 

(ii) A plain ball-bearing is used for constraint of the rotary, in place of conical 
bearing, in order to reduce friction and to facilitate construction. 

(iii) An autocoUimation method is used as a criterion of the angle of reversal of 
the rotary. 

(iv) As a fixed axis of rotation is not required for (iii), a definite constraint in the 
horizontal plane is not needed. The two working orientations are each defined by 
a pair of stops instead of by a single stop. 

(v) Adjustments to the objective are provided for squaring-on and for coincidence 
of the nodal plane and photographic plate. 

(vi) Automatic reversal is accomplished by means of a system of wires which 
exert a pure torque on the rotary and therefore no tilting torque on the tube. The 
system is such that unidirectional rotation of the driving shaft is converted into 
reciprocating rotation of the rotary. 

(vii) The plate carriage is aimular and the plate-holder mount is circular so that 
symmetry of diffraction pattern is secured. The carriage constraints are external 
to the aperture. 

(vih) Relative motion of the carriage and rotary is made to approximate to pure 
translation by means of a compensating system of flexed rods, which constrain the 



The Boyal Greenwich Observatory 153 

carriage in the horizontal plane to which the motion is restricted by means of three 
balls that roll between horizontal planes. 

(ix) Uniformity of rate in the relative translation of carriage and rotary is obtained 
by a specially designed system comprising a differential roller and metallic tapes. 

(x) The time scale is produced photographically by means of a clock-controlled 
lamp giving flashes of very short duration. An independent chronograph is not 
required. 

(xi) The height of the mer ury surface is accurately adjustable and, as criterion 
of adjustment for constancy of scale value, an optical null method has been 
introduced for use in conjunction with a suspended silica rod. 

Some consideration has also been given to the design of a new type of transit 
instrument, designated as the Horizontal Transit Instrument. The essential feature 
is that the telescope system remains fixed (though adjustable) with its axis horizontal 
and in an east-west direction. The light from a star of any declination, near the 
position of meridian transit, is directed along the optical axis by a subsidiary 
optical system of constant deviation, which can be rotated about an east-west axis 
and can be set to the appropriate declination. The effect on time determination 
of its positional errors (whether due to maladjustment of the axis of rotation or to 
pivotal errors) is reduced to the second order. Level and azimuth errors of the 
telescopic system have the same effect on the observed time of transit as they do 
with' the ordinary transit instrument; but since the telescopic system is not 
deliberately subjected to gross mechanical disturbances and suffers from no pivotal 
errors, these level and azimuth errors should be far more stable than in the reversible 
instrument. The collimation error is dealt with by duphcation of the telescopic 
system and reversal of the subsidiary system, so that the essential advantage of the 
reversible instrument is not sacrificed. Observation is made at the common focal 
plane of the duplex telescopic system, from the two sides successively. The fixity of 
the telescopic system avoids errors due to flexure, and permits of the use of a 
focal length considerably greater than can profitably be used in the ordinary 
reversible instrument. The level is determined with reference to two mercury 
surfaces, one at each end of the instrument, by means of an autocoUimation 
method. 

Instead of following the star image with a movable micrometer wire, a variable- 
deviation system is used by which the light in the telescopic portion of the instrument 
is kept always axial as the direction of the incident starhght rotates. In this way 
the tolerances of certain essential adjustments are greatly increased. Further, this 
variable-deviation system acts also as a micrometer and as the means by which 
signals are sent to the chronograph. An additional advantage of this axial method 
is that the fiducial line that bisects the star image is not required to move in order 
to follow the star’s image or to be linked to the signalling system as at present. Thus 
no mechanical errors are introduced at this point. An optical method is con- 
templated for defining the position of the variable-deviation system in such a way 
that in its performance as a micrometer or signal emitter the system will be 
effectively free from the effects of mechanical errors. A thorough examination of the 
theoretical aspects of the design has been completed. 



154 


Sir Harold Spencer Jones 

Magistetio and Meteorological Department 

The first extension of the work of the Observatory beyond that laid down in the 
Royal Warrant for its foundation came with the setting up by Airy in 1840 of 
a magnetic and meteorological department. Certain meteorological data are of 
importance for the astronomical observations; atmospheric refraction depends 
upon the barometric height and the temperature; atmospheric transparency, an 
important factor in photometric observations, is correlated with horizontal 
visibility; the measured variation of latitude depends to some extent upon the 
direction of the wind ; the amounts of sunshine, of rainfall, and of clear sky at night 
give some indication of the general observing conditions. The astronomer has to 
make his observations at the bottom of a dense atmosphere, and it is only to be 
expected that atmospheric conditions can influence the observations in many 
different ways. The effects are often unsuspected and obscure in origin and may not 
be discovered until results are analyzed; it was, for instance, quite unsuspected in 
advance that the measured latitude would depend upon the direction of the wind, 
though not upon its velocity. With a complete record of meteorological data, the 
basic data are available for any purposes of subsequent analysis. The Observatory 
makes continuous records of wind direction and pressure, of the total flow of air, of 
dry- and wet-bulb temperatures, of barometric height, of rainfall, of sunshine by 
day and of clear sky at night — ^the last being recorded by the trails of Polaris and of 
<J-Ursae Majoris obtained with a small fixed camera pointing to the pole. Daily eye 
observations are made of the barometer, dry-bulb and wet-bulb thermometers, 
radiation and earth thermometers, of the amount of cloud and of visibility. Some 
of the instruments used, such as the anemometers, are not of the most modern type, 
but the long series of observations made according to a uniform plan and with the 
same instruments is of special value for climatology. The Greenwich series of 
observations does, in fact, hold a unique place in British climatology. The data are 
of importance for various statistical purposes, such as questions of public health, 
the occurrence of epidemics, etc.; the meteorological results are therefore sent 
weekly to the Registrar-General. Observations are communicated daily to the 
Meteorological Office. 

Magnetic observations were commenced in 1840 at the same time as the meteoro- 
logical observations. Though not closely related, the magnetic and meteorological 
observations have always been in the charge of one department of the Observatory. 
This was primarily a matter of administrative convenience, to keep the non- 
astronomical work separate from the astronomical. 

When the magnetic observations were started, they were made visually every 
2 hr. throughout the day and night, but on one day ea,ch month they were made at 
5 min. intervals throughout the 24 hr. This severe and trying labour was eliminated 
by the introduction in 1848 of continuous photographic registration, which has 
been maintained ever since, though with various changes and improvements in the 
recording instruments. A century of photographic registration has therefore been 
completed; the records are stored at the Observatory and are of great value in 
a variety of investigations. 



155 


The Royal Greenwich Observatory 

In 1923 it became necessary to remove the magnetic observations from Greenwich 
because of the plans for the electrification of the local railway system. A site was 
selected near Abinger, on the slopes of Leith Hill, in Surrey, where a new magnetic 
observatory was built and observations were commenced in 1924. At that time the 
absolute observations of horizontal intensity were made with the Kew magneto- 
meter, and those of dip with the dip inductor, which had superseded the dip circles 
in 1913. A few years later coil magnetometers were introduced as the standard 
instruments for the absolute measurement of horizontal and vertical intensity. The 
Schuster-Smith coil magnetometer for the measurement of horizontal intensity was 
installed in 1927, and the Dye coil magnetometer for the measurement of vertical 
intensity in 1928. Both these instruments were constructed at the National 
Physical Laboratory and are on loan to the Observatory from the Laboratory. The 
potentiometers used in conjunction with them are checked from time to time at the 
National Physical Laboratory. 

The Schuster-Smith coil magnetometer has proved greatly superior to the Kew 
unifilar magnetometer in both speed and accuracy. The speed of observation is 
particularly valuable when conditions are at all disturbed. The base-line values of 
the horizontal intensity magnetograph deduced from the absolute observations 
have an uncertainty of not more than ly. The scatter of the base-line values of the 
vertical intensity magnetograph, deduced from the absolute observations with the 
Dye coil magnetometer is a little greater, but the uncertainity is only about 2y or 
3y. This instrument is adopted as the standard for vertical intensity, the dip being 
deduced from the observed values of the vertical and horizontal intensities. 

Absolute observations of decHnation are made several times every weekday, 
using for reference an azimuth mark whose azimuth is controlled by observations 
of Polaris ; those of horizontal and vertical intensity are made daily, except Sundays. 
Prequent observations of horizontal intensity are made with the Kew magneto- 
meter and of dip with the dip inductor; these observations serve as a general check 
on the observations with the coil magnetometers and are not othermse used. 

The Royal Observatory was the pioneer in using electrical coil instruments as 
standards; it is of interest to remark that a small systematic difference between the 
dip inferred from these observations, and the dip measured directly with the dip 
inductor was traced to an unsuspected defect in the inductor, arising from slight 
play in the bearings of the rotating coil. 

The recording variometers, which record declination, horizontal intensity, and 
vertical intensity, are of the well-known la Cour type. They include both slow-run 
and quick-run variometers. Records are also obtained with declination and 
horizontal intensity magnetographs of low sensitivity, which are of value in following 
the field changes during great magnetic storms when the large rapid movements 
cannot always be followed with certainty on the normal records. The published 
data include the hourly means of each element throughout the year, together with 
the monthly mean hourly values and the means for the five international quiet 
days and the five international disturbed days each month; the daily mean and 
daily extreme values for each element, with the corresponding monthly means for 
all days, for the quiet days, and for the disturbed days; the mean diurnal in- 



156 Sir Harold Spencer Jones 

equalities for each month, for the year, and for winter, equinox, and summer, of 
declination, dip, horizontal intensity and for north, west, and vertical components, 
for aU days, for international quiet days, and for international disturbed days 
separately ; the harmordo components of the diurnal inequalities of north, west, and 
vertical components, for each month, for the year, and for the three seasons, for aU 
days, quiet days, and d i s turbed days; together with mean monthly and annual 
values for all elements. 

The daily magnetic character figures, and the 3-hourly range indices are assigned 
on the basis of the daily records and are communicated regularly to the international 
centre at De Bilt. As opportunity offers, the estimation of the 3-hourly range 
indices is being carried backwards through the long series of Greenwich records, 
providing data of fundamental importance in many geophysical investigations. 

The Royal Observatory has for some 30 years assumed the responsibility for the 
preparation of the world magnetic charts which are published by the Hydrographic 
Department of the Admiralty. Charts of declination are prepared at 6-yearly 
intervals; of horizontal intensity and of dip at intervals of 20 years., During the 
war, in connexion with the degaussing of ships as a protection against magnetic 
mines, a world chart of vertical intensity was prepared. It has been decided that 
charts of horizontal intensity, of dip, of vertical intensity, and of total intensity will 
be prepared in future at intervals of 10 years, in accordance with a recommendation 
of the Association of Terrestrial Magnetism of the International Union of Geodesy 
and Geophysics. The preparation of these charts involves the collection and 
examination of magnetic observations and surveys made in all parts of the world; 
from these observations the secular change and the rate of change of secular change 
have to be inferred in order to reduce the observations to a common epoch and to 
extrapolate to the epoch for which the charts are prepared. 

Since the untimely loss of the non-magnetic ship, the Carnegie, put an end to the 
long series of ocean magnetic observations undertaken by the Department of 
Terrestrial Magnetism of the Carnegie Institution, Washington, the magnetic data 
over some of the ocean regions, and particularly over the southern Indian Ocean, 
have become increasingly uncertain. Reports received from vessels of the mercantile 
marine were sufficiently concordant to justify some empirical corrections to the 
charts. The matter was discussed with the Hydrographer of the Navy and, as 
a result of representations made to the Board of Admiralty, the construction of 
a non-magnetic ship was decided upon. The ship, known as the R.R.S. Research, was 
in an advanced stage of construction at the time of the outbreak of war, when work 
had to be suspended. The possibility of completing the ship, except for the auxiliary 
engines, and of putting her into commission as a sailing ship is under consideration, 
though no decision has yet been reached. 

The harmonic analysis of the world magnetic charts and the comparison between 
the observed and computed field at various points on the earth’s surface can give 
some indication of areas where the charts are seriously in error. The charts for 1922 
and 1942 were analyzed in this way, and in each case it was found that the com- 
puted position of the north magnetic pole was not in agreement wfith the adopted 
position, which was determined by Amundsen in 1904 and was in close agreement 








Spencer Jones 


Proc. Roy. Soc. A, volume 198, plate 1 




P RO SP^EitrS Sj^l^TmTRIONALU. 






. 















Figube 8. General view of Herstmoncenx Castle, 




167 


The Royal Greenwich Observatory 

with, the position assigned by Ross in 1831. The discordance appeared to be too 
great to be attributable entirely to errors in the charts, and it appeared probable 
that the magnetic pole had moved appreciably from its position in 1 904, in a direction 
somewhat to the west of north. When in 1945 a series of polar flights by the Lan- 
caster aircraft Aries was being planned by the Empire Air Navigation School, there 
seemed to be an opportunity for obtaining some evidence on this question. The 
Commandant of the Empire Air Navigation School agreed that such observations 
as were possible in conjunction with the other objects of the flight should be under- 
taken. One of the flights actually made passed over the Amundsen position of the 
magnetic pole and another passed near the computed position. The results of these 
flights provided strong supporting evidence for the movement of the magnetic pole. 
More recent observations made by the Canadian Eastern Arctic Patrol have fully 
confirmed the displacement, though not by so large an amount as the harmonic 
analysis had suggested. 

Since the termination of magnetic observations at the Kew Observatory, the 
Royal Observatory has taken over the responsibility for the testing and certifica- 
tion of magnetic instruments of different types. This work is undertaken not only 
for Government Departments and Colonial Governments, but also for various 
institutions and commercial firms. 

The Solab Depabtmejtt 

The magnetic observations, which are of some importance for navigation, led to 
a further development, which has no connexion with navigation. About the middle 
of the nineteenth century it was discovered independently by Sabine, Lament and 
Wolf that magnetic phenomena have a period similar to the 1 1 -year sunspot period, 
which had been aimounced shortly before by Schwabe. It was also found that 
magnetic storms often occurred when there was a large spot near the centre of the 
Sun’s disk. The relationship suggested the need for supplementing the magnetic 
observations by solar observations. The solar department was accordingly estab- 
lished by Airy in 1873. A photo-heliograph, in which the primary image of the Sun 
is magnified by an enlarging lens to give an image 8 in. in diameter, was installed, 
and the regular daily photography of the Sun, whenever conditions permitted, was 
commenced. These observations have been continued without interruption. Photo- 
graphs with a similar instrument are made at the Cape Observatory and are 
forwarded to Greenwich- There are normally but a few days in the year which are 
not represented in the combined Greenwdeh and Cape series ; photographs for the 
missing days can usually be obtained on request from either the Kodaikanal 
Observatory or the Mount Wilson Observatory. The positions and areas of sunspots 
and faculae appearing on each photograph are measured. A general catalogue of 
sunspots and ledgers both of recurrent and of non-recurrent groups are prepared 
from the measures. The total areas of umbrae, of whole spots, and of faculae for 
each day are computed, as well as the mean areas and heHographic latitudes for 
spots north of the equator, for spots south of the equator, and for aU spots. The 
collected results provide the most complete information that is available about 


Vol. 198. A. 


II 



158 


Sir Harold Spencer Jones 


sunspots and faculae, and one of importance in the study of the relationships 
between solar and terrestrial phenomena. 

The sunspot data have been used at Greenwich in a variety of investigations. The 
position of the Sun’s axis was determined firom the observations of sunspots in the 
period 1874 to 1912, using both recurrent groups and groups observed for eight or 
more days; the data were analyzed for three complete spot cycles separately, for 
fom dififerent phases of the cycle, and for the three chief zones of heliographic 
latitude. The inclination of the Sun’s axis of rotation to the ecliptic was found to 
be 7° 10'-5, and the longitude on the ecliptic of the ascending node to be 73° 46''8 
{epoch 1860'0). The motions of recurrent spots observed in five complete spot 
cycles, firom 1878 to 1933, have been analyzed to determine the Sun’s rotation 
period and its dependence upon heliographic latitude. The rotation periods derived 
from the five separate cycles were in excellent agreement, in contrast to the large 
secular change given by spectroscopic observations ; the sunspots, however, because 
of their cyclic fluctuation in frequency, cannot be used to determine the rotation 
period year by year. 

Many investigations have been made at Greenwich of the relationships between 
sunspots and terrestrial magnetic disturbances. The general statistical relationship 
between the occurrence of magnetic storms and the sunspot state of the central 
region of the Sim at the times of occurrence of the storms has been fully established. 
The largest storms tend to be associated with the largest spots ; on the other hand, 
though the largest spots have a strongly marked tendency to persist for several 
rotations, the largest storms show little or no tendency to recur after 27 days— the 
period of the solar rotation — ^whilst moderate storms show a marked recurrence 
tendency. When a spot appears to be the source of a magnetic disturbance, the spot 
IS usuaUy situated, at the moment when the storm begins, between 2 days east and 
4 days west of the Sun’s central meridian. 

In 1929 solar observations were extended to include visual observations of the 
Sun’s disk in Ha fight, using a spectrohefioscope lent by the Mount Wilson 

bservatory. The initial purpose of these observations was to detect any special 
^turbances on the Sun that might be related to the occurrence of magnetic storms. 
MeasT^ments are made, with the fine-shifber, of the radial velocities of dark 
markmgs and, in particular, of those in the neighbourhood of sunspots. The 
mtensities of bnght Ha flocoufi and of prominences relative to the adjacent back- 
groimd are determmed with a simple form of wedge photometer fitted with a com- 

p^on lamp. Visual measures of the contour of the normal Fraunhofer fine Ha at 
’ tne centre of the disk are also made. 


A study has been made of the bright chronsospheric eruptions or solar 

• ^as on the wane when the spectrohefioscope was 

TT, ^ ^ minimum, was increasing 

p y. e n^ber of flares showed a correspondingly rapid increase. By 1937, 
when the number of flares had still further increased, the association between the 
s and sudden fadings of short-wave radio transmissions, more particularly in 

the^ofthe larger and brighter flares, had been fuUy established.. The simultaneity 

e wo p enomena implied that the radio fadings were due to a solar agency 



The Royal Greenwich Observatory 159 

travelling with the speed of light. A direct comparison photometer to enable the 
peak intensities of the flares to be rapidly measured in relation to the adjacent 
continuous spectrum at 15 A from Ha was constructed in the workshop and installed 
in 1939. 

The flares are found to occur mainly in the vicinity of large sunspots when in the 
stage of active development. In a number of instances the full sequence of pheno- 
mena has been observed ; the bnUiant eruption with the synchronous radio fade-out, 
accompanied by a typical bay or crochet on the magnetic trace, followed at an 
interval of the order of 1 day by a great magnetic storm. The solar influence on 
geomagnetic disturbance has led to the study of various geomagnetic phenomena. 
It is found that there is a marked diurnal variation in the times of sudden com- 
mencements, with a minimum at about 8 to 9 hr. g.m.t. A small proportion of 
sudden commencements have the initial movement in a direction apposite to the 
normal; these ‘reversed’ sudden commencements show an entirely different diurnal 
frequency, with a maximum at the time when the normal sudden commencements 
show minimum frequency. 

The Brentwood radio station of Cable and Wireless Ltd. reports direct to the 
Royal Observatory any radio fade-out while it is in progress. Ionospheric data are 
sent regularly to Greenwich by the Superintendent, Radio Research Station, Slough, 
the Engineer-m-Chief, Radio Branch, G.P.O., and the Controller (Engineering), 
B.B.C. Information about sunspots and flares observed at Greenwich is supplied 
to various radio research centres, while an informal liaison over solar observations 
in general has been maintained with the Radio Group of the Cavendish Laboratory, 
Cambridge; the Radio Research Station, Slough; and the operational Research 
Group of the Ministry of Supply. It is of interest to note that though the solar 
observations were commenced at Greenwich purely because of the scientific interest 
in the possible relationship between solar phenomena and geomagnetic disturbance, 
the observations have become of practical value for the forecasting of ionospheric 
conditions. At the same time, they are of increasing importance for the theoretical 
investigation of the processes involved. 


Astrometry and Astrophysics Department 

In this department is included a wide range of investigations in astronomy which 
have developed from the application of photography to astronomy. It includes two 
main sections : (i) astrometry, involving precise measures of positions of star images 
on photographic plates, which may be purely differential, involving small dis- 
placements in position between two different epochs, or may be used to derive 
absolute positions, by using a number of stars as reference points whose positions 
have been separately determined by meridian observations; (ii) astrophysics, which 
is concerned with physical characteristics, brightness, colour, spectra, etc. 

The first development of the work at Qrreenwich in this direction was the 
participation in the great international project, proposed in 1887, of a photographic 
chart and catalogue of the entire sky. A large number of observatories shared in 
this project, which included the preparation of a catalogue, giving the positions of 


ii-a 



160 Sir Harold Spencer Jones 

all stars down to a certain brightness, and the publication of star charts showing all 
the stars to a fainter limit of magnitude. To each observatory there was allotted 
a certain region of the sky, the Greenwich Observatory assuming responsibility for 
the cap of 25° radius around the north celestial pole, 13 in, photographic refractors 
of the same focal length, having a scale on the plates of 1 mm. to 1' of arc, each 
plate covering a field of 2x2°, were to be used. The astrographic refractor for 
Greenwich was made by Grubb of Dublin, similar telescopes being made for a 
number of other observatories. For the determinations of position, all the stars in 
the Greenwich zones down to the limit of magnitude 9^0 were observed with the 
transit circle in the years 1897 to 1905, while the measurement of the rectangular 
co-ordinates of the star images on the photographic plates was in progress. During 
these measurements, the diameters of the star images on the photographic plates 
were estimated in order to derive photographic magnitudes of the stars by the use 
of an empirical relationship connecting diameter and magnitude. In addition, the 
photographic magnitudes of all the stars brighter than 9^0 were determined with 
a Cooke triplet lens camera of 6 in. aperture and 27 in. focus, covering a large field 
without appreciable distortion. Each region was photographed when at the altitude 
of the pole, the polar region being also photographed on the plate, so that the 
magnitudes of the field stars could be derived by comparison with the standard 
north polar sequence of magnitudes, which had been determined at the Harvard 
Observatory. The sequence of exposures was arranged so that the mean time of 
exposures on the field was in close agreement with the mean time of exposures on 
the pole, the assumption being made that the atmospheric absorption was equal 
under these circumstances for the two regions at the same altitude. This very large 
programme of work was spread over a number of years. The project was an ambitious 
one and proved to be beyond the power of some of the co-operating observatories, 
so much so that it has not even now been brought to completion after the lapse 
of more than half a century. The Greenwich section, both of the chart and of the 
catalogue, was one of the first to be completed. The results are published in 
rectangular co-ordinates, wdth sufficient data to enable the right ascension and 
declination of any star to be readily derived. For all the stars, however, down to 
magnitude 9*0 on the scale of the Bonn Durohmusterung^ together with all fainter 
stars included in the catalogues of the Astronomische GeseUschaft and in Carring- 
ton’s circumpolar catalogue, 16,780 stars in all, right ascensions, declinations, and 
photographic magnitudes were published in a separate volume. 

Many programmes of observation undertaken at Greenwich since the completion 
of this work have been planned in order to make the available information about 
the stars in the polar cap, from dec. + 64° to the north pole, more complete. The 
proper motions of the stars for which more than one position 'had been determined 
were derived by the comparisons of all available catalogues, each catalogue being 
reduced to a common basis by the application of systematic corrections. Then, 
commencing in 1923, the whole region was rephotographed with the astrographic 
telescope for the determination of relative proper motions of the stars. This second 
series of plates were exposed through the glass, so that by placing the corresponding 
plates of the two series of photographs film to film, the two images of each star were 



161 


The Royal Greenwich Observatory 

brought into close proximity. It was necessary only to measure the small dis- 
placements between corresponding images differentially, and to apply corrections 
for differences in scale and orientation of the two plates in order to derive the 
relative proper motions. Comparisons between the photographic proper motions 
and proper motions based on meridian observations, where the latter were available, 
provided the data for converting from relative to absolute proper motions. The 
probable error of the derived proper motions was about ± 0"*8 per century in each 
co-ordinate. The value of the Greenwich Astrographic Catalogue as a source of 
stellar positions was greatly enhanced by the determination of the proper motions. 

The astrographic telescope has been used for a variety of other investigations. 
Mention may be made of a special determination of the magnitudes of the stars in 
the standard north polar sequence, using a coarse wire grating to give sensibly round 
first diffracted images, with a calculable difference in magnitude from the central 
image. The magnitudes of the sequence had been determined at the Harvard and 
Mount Wilson Observatories*, but there was an appreciable difference in scale 
between the two determinations. The investigation at Greenwich proved that the 
Mount Wilson scale was correct. The telescope also co-operated, along with the 
26 in. refractor, in the two international programmes of observation of Eros at the 
favourable oppositions of 1901 and 1931 for the determination of the solar parallax. 

The main programme of observation with the 26 in. photographic refractor, 
which was presented to the Observatory by the distinguished surgeon, Sir Henry 
Thompson, has been the measurement of steUar parallaxes. Such observations 
demand great care and precision and, before the commencement of the work at 
Greenwich, had been made mostly in the United States with telescopes of much 
greater focal length. The high latitude of Greenwich is not favourable for stellar 
parallax work, because the short nights in the summer make it impossible to secure 
observations near the times when the parallax factors are at a maximum; more 
photographs are needed for the same weight in the parallax determinations than in 
lower latitudes. The weather at Greenwich, moreover, makes it difficult to obtain 
a proper balance in the observations at the several epochs at intervals of about 
six months which are required. By special care in the adjustment of the lenses of the 
objective, controlled by photographs at intervals to detect any tilt or eccentricity, 
and by other precautions, the results have proved to be of an accuracy comparable 
with that given by longer focus telescopes. 

The observations have been confined to stars in the Greenwich astrographic 
zones. The observing fists include aU stars in this region of magnitude 5*5 or 
brighter; stars down to 7“^ with annual proper motions greater than 0"*10; stars 
down to 8^ with proper motions greater than 0"*15; fainter stars with proper 
motions greater than 0''-20, together with stars of type KO with proper motions 
greater than 0"*05, the latter being included to obtain information about the dis- 
tribution of these stars in absolute magnitude. The photographs are obtained with 
the use of one of a series of rotating sectors, the aperture being chosen to reduce the 
magnitude of the parallax star to about ll’?^5, the magnitude of the stars selected as 
comparison stars. For the brightest stars sufficient reduction in magnitude cannot 
be obtained by using a rotating sector; local desensitizing of the central region of 



162 Sir Harold Spencer Jones 

the plate with copper sulphate was tried for a time, in combiaation with a rotating 
sector, but the magnitude reduction produced by the desensitizing was uncertain. 
A neutral filter, giving a magnitude reduction of about 6“ was therefore employed, 
in combination with a suitable sector. 

The observations were at first made by Kapteyn’s method, in which the plate 
is exposed at one epoch, then stored undeveloped, and exposed again at the next 
epoch, the small displacements between the two series of images being measmed. 
The method resulted in much loss of weight; good definition at one epoch might be 
followed by bad definition or exposures interrupted by cloud at the second. There 
were difficulties in balancing the different epochs. The method was therefore 
abandoned and each plate developed after exposure at the one epoch. For the 
measurement of the series of plates for the determination of the parallax of a star, 
suitable comparison stars were selected; a blank glass plate was then ruled with 
short fine parallel lines near the position of each comparison star, and of the parallax 
star. A plate was placed film down on the ruled plate, and the small displacements 
between each star and the rufings were measured. The ruled plate served as a dummy, 
which permitted of accurate setting of the micrometer wire and with which each 
stellar photograph was compared, thereby making possible the intereomparison 
between the stellar photographs themselves. Three separate exposures were normally 
given on each plate during the first series of observations. The practice was then 
introduced of giving two exposures, the plate being turned through 180° between 
the exposures; this procedure reduces any errors that may be caused by small local 
film distortions. It has been found that two exposures, vfith intermediate reversal, 
give the same accuracy as three exposures without reversal and with the advantage 
of saving time at the telescope. About 760 determinations of stellar parallax have 
been made since observations were commenced. 

The 26 in. refractor has been used for a number of smaller programmes. Several 
series of photographs of Jupiter’s satellites were obtained at the request of Professor 
de Sitter, to provide material for his determination of the elements of the orbits of 
the satellites and for his investigation into the theory of their motions. Photo- 
graphic magnitudes of stars down to magnitude 14 in a number of Kapteyn’s 
Selected Areas, zones + 16°, -i- 30°, -f- 46°, and -1- 60° declination, were determined 
by comparison with the north polar sequence; the magnitudes of 396 stars within 
1° of the north pole were also measured. 

A 30 in. reflector is moimted on the same mounting as the 26 in. refractor; the 
arrangement is not convenient, as it is never possible to use the two telescopes at 
the same time and there can be inconvenient competition between the demands on 
the same mounting for different programmes of work. The reflector has been used 
for the photography of comets and other celestial objects ; in particular, a very fine 
series of photographs of Comet Morehouse 1908, whose tail changed markedly firom 
night to night, was secured. On photographs taken with it, the eighth satellite of 
Jupiter was discovered by Melotte. The reflector was used also for a programme of 
determination of effective wave-lengths of stars in the north polar cap. Using a 
coS^se wire difffaction gratmg over the end of the telescope, the distance between 
the two first diffracted images depends upon the grating interval, upon the focal 



163 


The Royal Greenwich Observatory 

length and upon the wave-length of the light. The first diffracted images are really 
short spectra. The distance between the points of maximum photographic intensity 
determines the colour or effective wave-length of the star. The points of maximum 
intensity depend upon the distribution of light in the spectrum of the star, and 
upon the intensity curve of the photographic plate. The use of a reflector avoids 
diflGlculties from chromatic aberration. The growth of the diffracted images is not the 
same for stars of different colours, so there is an exposure-time effect, which must be 
allowed for. The image given by a 10th magnitude star with ah exposure of 10 min* 
was used as standard. The change of effective wave-length with spectral type of the 
star depends upon the type of plate employed ; with panchromatic plates and a yellow 
screen the change was found to be approximately linear, whereas with blue- 
sensitive plates there was little change between AO and F8. The dispersion in the 
first diffracted images in this mvestigation was comparable in amount with atmos- 
pheric dispersion at a zenith distance of about 60® ; the results obtained throw much 
light on the relative displacements for stars of different types due to atmospheric 
dispersion, which may introduce systematic errors into the determination of the 
solar parallax from observations of the minor planet Eros or of any other asteroid, 

- In 1931, Mr WiUiam Johnston Yapp offered to present a large telescope to the 
Observatory. A 36 in. reflector was decided upon. The telescope was brought into 
use in 1934 for the continuation of a programme of observations of the colour 
temperatures of stars which had been commenced with the 30 in. reflector. The 
programme was a particularly difficult one to undertake at Greenwich, where the 
atmospheric transparency is poor, variable, and not uniform in amount m different 
directions. The colour temperature of a star is the temperature of a black body 
which has the same relative distribution of energy throughout the spectrum as the 
star. The determination of colour temperature involves comparison with a terrestrial 
source whose colour temperature is known; it is convenient, however, to divide the 
investigation into two parts, intercomparing the stars and then making comparisons, 
with the terrestrial source, A selection was made of twenty-five stars, of spectral 
types A and B and fairly evenly distributed, to serve as standard stars. These 
standards were intercompared one with another, when at the same altitude. Other 
stars were then compared with one or more of the standard stars. 

The reflector was employed with a shtless spectrograph. At first a coarse wire 
diffraction grating with dispersion at right angles to that of the spectrograph was 
used to provide a photometric scale. But this was wasteful of observing time. 
A scale spectrograph was therefore used with multiple slits, whose breadths were 
closely in the ratio of 1 : 2:4, the dispersion being approximately the same as that 
of the shtless spectrograph. The exact ratios of the hght passing through the three 
sHts were determined by a half aperture method. A Lummer-Brodhun cube micro- 
photometer was used for measuring the spectral intensities, measurements being 
made at eight points in the blue and at eight in the red, free from absorption lines. 

As standard source of comparison, a standard acetylene burner, with a nominal 
colour temperature of 2360® K, was used. The burner was specially cahbrated at the 
National Physical Laboratory. The difference of colour from the stars was reduced 
by the insertion of a blue filter in the beam from the burner, whose absorption was 



164 Sir Harold Spencer Jones 

measured ■with, the scale spectrograph. The burner was placed at a distance of about 
600 ft. on the Octagon Room roof. The horizontal reddening of the acetylene flame 
in this distance was determined by special obser'vations. 

The colour temperatures of most northern stars brighter than 4'?5 and of spectral 
type A or earlier, as well as of many fainter stars of these types and of a selection 
of bright P- and G-type stars have been determined. 

A slit spectrograph for use -with the 36 in. reflector was completed in 1937. The 
optical parts are made of ultra-violet glass, and one-prism or three-prism dispersion 
can be used. The spectrograph was mounted towards the end of 1939, on the com- 
pletion of the colour-temperature programme and various tests were made. But 
circumstances at that time made it impossible to commence any definite programme 
of observation. Since the war it has been employed in attempts to detect faint blue 
companions in late-type spectroscopic binary systems. 


Misobllaneoxts prooeammes 

Some items of the work of the Observatory do not come definitely within the 
scope of any particular department, but depend to some extent upon the personal 
interest of individual members of the staff. Expeditions have been sent from time 
■to time to various parts of the world to make observations of total eclipses of the 
Sun, the programmes being determined by the problems of current importance. It 
was the expedition from Green'wich to Brazil for the observation of the total eclipse 
of 29 May 1919 which provided the first evidence to confirm Einstein’s predicted 
displacement of stars in the vicinity of the Sun. The most recent expedition was 
a small expedition to Mombasa for the eclipse of 1 November 1948, to test a method 
of accurate deternunation of the position of the Moon, designed to be used on the 
occasions of a total ecMpse at two widely separated centres, for the purpose of 
providing an accurate geodetic connexion. 

The 28 in. refractor, installed in 1886, has been employed for many years for the 
observation of close double stars with a filar micrometer. The measures obtained 
up to 1919 were collected and published in a single volume. The observations were 
used, in conjunction ■with observations made elsewhere, for the determination of 
the dynamical parallaxes of 676 double stars. The parallax of a binary system of 
known period and angular dimensions of orbit can be calculated if the combined 
mass is kno^wn ; in the absence of a knowledge of the mass, the error introduced by 
assuming the combined mass to be twice that of the Sun is relatively small, as the 
mass enters only to the power of The parallax so deduced is called the dynamical 
parallax. When the binary star has not been observed for a complete period, the 
parallax can be estimated, though with less certainty, from the rate of change of 
angle and distance. 

A marked improvement in the accuracy of the measures followed from the 
mtroduc^fcion of a comparison image microme^ter, constructed in the Observatory 
workshop. A WoUas^ton prism is used to give a double image of an artificial star, the 
separation of the two images being varied by altering the distance of the so^urce 
from the prism. Rotation of the prism rotates the position angle of the artificial 



165 


The Royal Gremwich Observatory 

binary. Crossed nicols permit the brightness of either image to be varied. A blue 
filter is used to give a colour temperature of about K, so that the artificial 
images appear of about the same colour as the stellar images. The position angle, 
separation and magnitudes of the two images are adjusted to be comparable to 
those of the binary star; the four images, two real and two artificial, are set to form 
the corners of a parallelogram. The eye is sensitive to any slight lack of symmetry, 
even under unsteady conditions when the use of a filar micrometer is difiS-cult. The 
comparison image micrometer is particularly advantageous in the measurement of 
close pairs and, because no field or wire illumination is required, it enables fainter 
stars to be observed. 


CHROlTOMETEit DePAETMENT 

In L821, soon after the control of the Royal Observatory passed from the Master- 
General of the Ordnance to the Board of Admiralty, the charge of chronometers 
used in H.M. Navy was transferred to the Royal Observatory. In the following year 
public trials of chronometers were instituted. Makers were invited to submit 
chronometers for trial, with a view to purchase by the Admiralty, and money prizes 
of considerable value were offered for the best chronometers. This system of annual 
trials was continued up to the beginning of the World War of 1914-18, though the 
giving of special prizes was discontinued after a few years. These trials had a con- 
siderable influence in stimulating improvements in the design and construction of 
chronometers. 

Because increasing difficulty was found in getting chronometers and navigational 
watches satisfactorily repaired and adjusted by the trade, a Repair Shop was 
started in connexion with the CSbronometer Department in 1937, where repairers and 
adjusters could be trained to the high degree of skill essential for work on precision 
time-pieces. The Repair Shop proved invaluable during the war, when the numbers 
of repairs to be dealt with was very large and when the Chronometer Department 
was testing and issuing an average of from 25,000 to 30,000 chronometers and 
watches a year. It is expected that the amount of repair work undertaken wiU 
gradually expand, and a scheme has recently been introduced for the indenturing 
and training of apprentices. The facilities of the Repair Shop are available also for 
any fine precision work required in connexion with the construction and modifica- 
tion of instrumental equipment. 

Optical aitd Electeootc Laboeatoeies 

The progressive expansion in the scope of the work of the Observatory has 
involved the employment of much specialized equipment, which has had to be 
designed in the Observatory and made in the Workshop. The sHtless spectrograph, 
the scale spectrograph, and the Lummer-Brodhun cube microphotometer, employed 
in the colour-temperature programme, and the comparison image micrometer for 
double-star observations are a few of the many items of equipment which have been 
made at the Observatory. The satisfactory design of new equipment not infrequently 
requires a considerable amount of preliminary investigations and tests, and the 



166 


Sir Harold Spencer Jones 

need for a definite laboratory section has progressively increased. The introduction 
of quartz crystal vibrators as standard clocks required the use of much ancillary 
electronic equipment; because much of the special equipment needed was not 
available commercially, it became necessary to design and construct it in the 
Observatory. A laboratory section has accordingly been formed, comprising both 
an optical and an electronics laboratory. Started initially for the special needs of 
the Tune Department at the time when the work of that department was expanding 
rapidly, the laboratories can meet the requirements of all departments of the 
Observatory. Amongst the electronic equipment which has been designed and 
constructed, mention may be made of a special radio receiver for the reception of 
radio time signals transmitted on the long-wave band with low carrier frequency, 
designed to secure constant lag, a high degree of selectivity, and a steep build-up 
curve, and with provision for selecting any particular point on the build-up curve 
and for securing constant output voltage; of a 2Mc./seo. standard frequency 
transmitter, controUed by one of the lOOkc./sec. oscillators; of a receiver for 
comp^g the frequency of the Droitwich 200 kc./sec. transmissions with any of 
the Abinger standards; of a number of double current electronic send relays- and 
of a great deal of switching, modulating, and monitoring equipment. 


JNAtTTioAL Almanac Office 

^ 1766, the year after his appointment as Astronomer Eoyal, Maskelyne 
published the first number of the Ncmtical Almanac (for the year 1767) The 
Alrmmc w^ deigned for the use of seamen and particularly to facilitate the 
emp ojment of the method of lunar distances for determining longitude. It proved 
to be a most valuable aid to navigation. Maskelyne continued to produce it 
annually for 44 years, until his death in 1811. The computations for the Almamc 
were perfomed m duphcate by computers, mostly working at home, and it acquired 
very weU-^se^ed reputation for high accuracy. In 1781 Maskelyne published 
volume oiT^l^Reqnis%te to be Used with tU NauUcal E^hemeris, which was in 

AsSnom^ 1 succeeded Maskelyne as 

he remained nominally responsible for it. The Almanac lost its reputation for 
with it^ ft’ ually m 1831 a separate Nautical Almanac Office was established 

In m7^Sr^ f rf connexion with the Observatory. 

Asteftl,!!!’ P ^auac Office was again placed under the direction of the 

^tronomer Eoyal ^d became a branch of the Eoyal Observatory, though retaining 
Its 0 ^ identity and its Superintendent. The Almanac at that time was responsiblf 

Shortly 



167 


The Royal Greenwich Observatory 

presented in a special way, Greenwich hour angle being used instead of right 
ascension, and to a lower degree of accuracy than in the Abridged Nautical Almanac. 
Special Air Navigation Tables were also prepared in the office and published, for use 
with the Air Almanac, to facilitate the rapid derivation of the position of the 
aircraft. 

In 1940 the publication (for the year 1941) of an annual volume of Apparent 
Places of Fundamental Stars was commenced. This volume gives the apparent places, 
at 10-day intervals, for most stars, but at daily intervals for close circumpolar stars, 
of the 1535 stars in the FK3 Fundamental Catalogue; the time determinations at 
aU observatories are based upon these positions. The computations of the apparent 
places are shared by the United States, France, Germany, and Spain; the Nautical 
Almanac Office is responsible for the co-ordination of the work, for the collation of 
the data, and for the preparation and publication of the volume. 

At the Conference of Commonwealth Surveys in London in August 1947, a strong 
desire was expressed for a special almanac to be prepared and published to meet the 
needs of topographical surveyors. Detailed proposals were therefore prepared in the 
office for a Star Almariacfor Land Surveyors and have been approved. The first issue 
will be made in 1950 for the year 1951. 

The experience gained with the Air Almanac has confirmed the advantages of 
the method of tabulation of data according to Greenwich hour angle, and has given 
rise to a desire for the revision of the Abridged Nautical Almanac. Various alternative 
arrangements of presentation of the data have been considered, and, after much 
detailed consultation with all classes of users, the final form has been settled. In its 
revised form, which will be issued in 1951 for the year 1952, the Almariac will 
tabulate Greenwich hour angle in arc directly, instead of right ascension. 

Special investigations into methods and tables for both sea and air navigation 
have been made, including a comprehensive survey of tables for astronomical polar 
navigation. The office has also been responsible for the preparation and publication 
* of various tables, including Seven-Figure Trigonometrical Tables for Every Second 
of Time (1939), Five-Figure Tables of Natural Trigonometrical Functions (1941), 
Planetary Co-ordinates for the Years 1800-1940, Referred to the Equinox of 1950-0 
(1933), and of a continuation volume for the years 1940-60 (1939). 

The office has been a pioneer in the adaptation of computations, formerly 
performed by logarithms, to machines. Because of its wide experience in methods 
of numerical computation and its machine equipment, it was able to provide a 
computing service to deal with a great variety of problems for various Government 
departments, which presented themselves during the war. Much preliminary 
investigation was often needed to discover the best method of solving special 
problems, with the least expenditure of labour and of time. Approval has recently 
been given for the installation of a complete range of HoUerithpunched-card 
equipment, suitable for general computational work. It is intended to extend the 
use of the equipment, where suitable, to the work of the Observatory as a whole, 
as well as to the more routine work of the office. It is also hoped, eventually, to 
produce copy for some of the office publications automatically on card-operated 
machines. 



168 


Sir Harold Spencer Jones 

The office has a close link with navigational problems and maintains a complete 
library of the navigational almanacs and tables of all countries. It is at present 
engaged on the computational w'ork necessary for the latticing of charts required 
for the Decca system of navigation. 

Removal of the Observatory from Greenwich 

The conditions at Greenwich for astronomical observations have progressively 
deteriorated as London has grown outwards beyond the Observatory. The increasing 
pollution of the atmosphere and the increasing brightness of the sky at night have 
combined to affect adversely the quahty and nature of the observations. Photo- 
metric and spectrophotometric observations, which require a uniform transparency 
in different directions and freedom from rapid variations of transparency, are 
practically impossible when clouds of smoke from nearby power stations and 
factories drift over the Observatory. But every type of observation is affected — 
meridian, solar, visual and photographic; in the exacting work of double star 
observations it has become impossible to observe close doubles which were observed 
with relative ease half a century ago. Under such conditions, the removal of the 
Observatory from Greenwich was essential if the Observatory was to continue to 
make useful contributions to astronomy. With the strong support of the Board of 
Visitors, proposals- to remove the Observatory to a new site were submitted to the 
Board of Admiralty shortly before the outbreak of war. The war started before 
a decision was reached, and the question of removal had then to be deferred. During 
the war the principal instruments were partially dismantled, the Time Department 
was transferred to Abmger, and the Chronometer Department, with the Repair Shop, 
was moved first to Bristol and then to Bradford-on-Avon. The work of the Magnetic 
Observatory was becoming hampered by disturbances from the extension of railway 
electrification, and proposals for its removal to a site remote from railways were 
made. Widespread search for suitable new sites was carried out. After the termina- 
tion of the war, the question of removal was again taken up. A short list of the most 
promising sites was prepared, and these sites were visited by a Committee of the 
Board of Visitors. Finally, it was announced on 11 April 1946 that Herstmonceux 
Castle in Sussex had been chosen as the future home of the Royal Observatory, and 
approval was given for the transfer of the Magnetic Observatory to a site to be 
selected in north Devon at a distance of at least 10 miles from any railway. 

Some 372 acres of ground were acquired with the fifteejnth-century castle, pro- 
viding adequate space for erecting the various telescopes and for future additions 
to the equipment, and a safeguard against near encroachment by undesirable 
developments. A first stage of the removal is in progress. The Chronometer 
Department and the Secretariat have moved to Herstmonceux. A solar building 
to house the photohehograph, two spectrohelioscopes, and spectrographic equip- 
ment for solar research is nearing completion. The transfer of the Solar Department, 
of the Magnetic and Meteorological Department, and of the Nautical Almanac 
Office should be possible during the course of the present year. Further stages of 
the removal, involving the erection of buildings and domes for telescopes, are under 



169 


The Boyal Oreenwich Observatory 

consideration. The 26 and 28 in. refractors, whose domes were seriously damaged 
during the war, have been dismantled; the opportunity is being taken for some 
alterations to be made to these telescopes before re-erection takes place. The work 
of some departments of the Observatory is necessarily on a reduced scale during 
the present period of transition. There are, however, great hopes for the future, 
when the Observatory has been fully established in its new home and can reap the 
advantage of the good observing conditions. A selection of several possible sites in 
north Devon for the Magnetic Observatory has been made, and tests of their 
freedom from magnetic anomalies have been made. Some further sites wiU be 
examined before a definite selection is made. 

On the occasion of the commemoration of the tercentenary of the birth of Sir 
Isaac Newton, held in London in July 1946, the President of the Royal Society 
announced that the Chancellor of the Exchequer had agreed to provide funds for 
the construction of a reflecting telescope of 100 in. apertme, to be associated with 
the name of Sir Isaac Newton and to be available for use by qualified astronomers 
from all observatories in Great Britain. It has been decided that the telescope will 
be erected in the grounds of the Royal Observatory at Herstmonceux. The telescope 
wiU be under the administrative control of the Astronomer Royal; a special Board 
of Management will be responsible for the scientific direction, including the 
designing of the telescope, the supervision of its construction, the consideration of 
programmes of observation, and the allocation of observing time between the 
various users of the telescope. The Board of Management will consist of the 
Astronomer Royal (Chairman), the Astronomer Royal for Scotland, and the 
Directors of the Cambridge and Oxford University Observatories as ex-officio 
members, together with four EeUows of the Royal Society and four Fellows of the 
Royal Astronomical Society. The telescope will enable British astronomers to under- 
take many programmes of observations which have hitherto been impossible because 
of the restricted light-gathering power of the largest telescopes at present in use in 
Great Britain, while the library, workshop, and other facilities of the Royal 
Observatory will be available to aU users. 



A note on the Riesz method and the method of residues 

By P. C. AxjiiTroK and D. S. Kothabi, University of DdM 

{Gommunicated by M. N. Saha, F.It.8. — Received 26 July 1948 — 
Revised 1 January 1949) 

% 

It is sihoTm that the Kiesz method of analytic contiauation and the method of residues give 
the same results in ‘the classical electromagnetic theory of a point source. 


1. It is well known that in the framework of the classical electromagnetic theory, 


divergence-free results can be obtained by the A-limiting process due to Wentzel, 
Dirac and Pauli, or by using the powerful method due to Riesz where the potential 
is obtained by analytic continuation to a = 0 of any arbitrary parameter a. There 
is a third method in which the potential at any point is given by a contour integral 
(Frenkel 1926). In a recent paper Ma has established the equivalence, in the case of 
the field of a point source, of the A-limiting process and the Riesz method. The 


present note deals with the equivalence of the method of residues and Riesz method. 


For the sake of completeness a brief description of both these methods is also given. 


2. We define the matrix tensor as 

?00 ~ 9^11 ~ 9^22 == 9 ^ 33 ”"“ 9fiv “ ® + ^)' 

The velocity of light is taken as unity. We write [A, B] for the scalar product of two 
vectors -4 , R., 

The Maxwell equations of the electromagnetic field are 

. , dA „ ^ y . 

nA^ = 4:7r}^, -^^ = 0 . ( 1 ) 

where is the potential 4-veotor and^'^ represents the charge-density and current 
4-Tector. The field tensor is defined in terms of the potentials by the relation 




Let the co-ordinates of an electron be denoted by a 4-vector (sometimes written 
merely as z); z^ is a function of the proper time r of the electron. The current vector 
j^{x) at any space-time point x is given by 

= ejz^{T)d*{x-z)dT; (3) 

—z) — Zq) S{xj Zj) S{x 2 — ^{x^ — Z g) . 

Dot denotes differentiation with respect to the proper time t. The retarded and'the 
advanced proper times associated with any world point x are defined by 


[a:-z(T), a;-z(T)] = 0, 
[ 170 ] 


( 4 ) 



A note on the Biesz method and the method of residues 171 

where (ajo — Zo)>0 for the retarded proper time a^id (ajQ— Z q)< 0 for the advanced 
proper time. 

We first consider the method of Riesz which has been applied to the classical 
fields by Fremberg and its quantization disoussed by Gustafson. We begin by defining 
A^{x) as (Fremberg 1946) 


A^{x) = 


4:7T 


where 


ilHH 


(5) 


“ Hia) 

H(a) = 2“+%r 
and = [x—x',x—x']. 

The domain of integration consists of the fomr-dimensional region bounded by the 


retrograde light cone having its vertex at x and the three-dimensional h37perplane 
070 = — 00. j^{x) describes the source distribution: ^ = 0. It is readily established 

dA^ 


that 


dx,. 


= 0, □ A^{x) = A^-^ix). 


(6) 


The integral defining A^{x) is convergent for a > 0. By analytic continuation Alf^x) 
can be defined for all values of a in the a-pkne. It can be shown that 

A-\x) = inj/po), (7) 

where A ~ ® (*) stands for the analytic continuation to a = — 2 of A^{x) , and, therefore, 
Appc) analytically continued to a = 0 represents the solution of Harwell equations 
(1). In the case of a point charge describing the world line 2 ^(t), ( 5) reduces to 


A“{x} 




( 8 ) 


where is the retarded proper time associated with x. [To carry out the analytic 
continuation we note the general theorem (see, for instance, Ma (1947)): for any 
function /(g) which can be expanded into a Laurent's series of the form 




when analytic continuation is performed to a = 0, we have 

(9) 

whatever X may be. is the ‘finite part' of /(g).] 

When X does not lie on the world line, A^{x) gives the Maxwell retarded potential 
for a point charge, and~when x lies on the world line we obtain (Ma 1947) from (8), 

(10) 

where the suffixes 4- and — indicate that in the expressions r refers to the advanced 
proper time and the retarded proper time respectively. 

As an illustration of the method we consider the case of an electron at rest. Noting 
that for 0, II{cx) reduces to 47r/a, (8) gives 

A${x) = ea r Af{x) = 0 (Z = 1, 2, 3). 



172 


F. C. Aijluck and D. S. Kothari 

Agfl,in rir = - \x-z{r),z(T)'\dT = \'r\^)dT, 

■where | r j is the length of the three-dimensional part of r. Hence we have 

which on using ( 9 ) immediately gives 

= for |r|>0, 

and =0 for 1^1=0. 

(The analytic continuation of A^{x) for a = 0 is written simply by - 4 ^, (a;).) Thus at 
every accessible point the Riesz potential for an electron at rest reduces to the 
Coulomb value, but at the electron itself its value sinks to zero. The divergence at 
I r I = 0 is automatically eliminated. In the case of the field we find by first differ- 
entiating (11) with respect to | r | and then performing analytic continuation that 
it has the Coulomb value everywhere except at | r | = 0, where it vanishes. Similarly, 
the energy of the electrostatic field outside a sphere of finite radius \ r\ is e^/2 | r |, 
but the total energy of the field (the electrostatic self-energy) instead of diverging 
has the value zero. It should be noted that the self-energy vanishes in spite of the 
fact that the field and the energy density have the Coulomb value at all points 
(except I r I = 0).* 

The Riesz potential thus gives the retarded potential for all points not on the 
world line, and on the world line it reduces to half the difference of the retarded and 
the advanced potentials. Thus, we see that the analytic continuation of (8) to a = 0 
gives the same result as the Dirac-Wentzel method. Ma (1947) has also established 
the equivalence of the two methods for a general point source. 

3 . We now turn to the method of residues (Frenkel 1936). For a point source, 
describing the world line consider in the r-plane the contour integral 

f = s^ = [x-z{r),x-z{T)]. ( 12 ) 

vanishes for r = t_ (retarded proper time) and for r = (advanced proper time). 
(In (12) T is a complex quantity; when r is real we write as in § 2, in place of s^,) 
For the contour C we have the following four cases: 

(i) C does not include r_ and t+: / = 0. 

(ii) C includes but not r_^: I gives the retarded potential. 

(iii) C includes but not t_: I gives the advanced potential (with a negative 
sign). 

(iv) G includes both and r^: I gives the difference of the retarded and the 
advanced potentials. 

We therefore define A^{x) by 

. 

* It is interesting to compare this result with that of Pryce (1938). The energy -momentum 
tensor defined by Pryce makes the energy density for an electron at rest vanish throughout 
space, and the self-energy vanishes as a consequence of it. 



A note on the Riesz method and the method of residues 173 

where = 1 for x^z, and J for rc = The contour C is to be taken about the 
retarded point only for x^z\ when a; = z the retarded and the advanced points 
coincide and C includes, therefore, the advanced point as weU. It is readily shown 
that when x^z, (13) gives the retarded potential, and ^OTX = z it reduces to half the 
difference of the retarded and the advanced potentials. 

We shall show that equation (13) is equivalent to (8) of Eiesz’s method. Consider 
the case when x lies on the world line (x = z{tq)). We split the range of integration 
in (8) from ~ oo to (tq — e) and from (tq — e) to Tq (e is a small quantity). The integral 
from -00 to (tq-c) is conv^gent for a< 1, and, therefore, it vanishes for a = 0. 
We expand the integrand in the second integral in a power series of cr - (Tq—t), 
and we obtain 

— ^2 _ ^2^ _ fjS _j. 0{o^)i (14:) 

and hence we have 


^ ea f z^{tq — O') { 1 — d<r. 
Jo 0 


(15) 


When analytic continuation to a = 0 is performed, (15), according to (9), reduces 
to the 'finite part’ of the expression 

-O') ^ ez^jTQ - cr) (T ^ ez/r) (r - T q) 

ofil— ’ 
which in terms of a contour integral is 


1 r 

2i7rJc 

the contour G being round the point r = Tq. For the case when x does not lie on the 
world line, (14) gets replaced by (in the neighbourhood of the retarded point r) 

r® = —2/c(t_)(t— r_) — (1 — /c'(t_))(t— T_)®— J/c"(r_)(r— T_)® + 0((r— T_)^), (16) 


where 


k{t) = [z,x-z], 


k'^{t) = [z,x-zl 

Proceeding as before it can be shown that on analytic continuation to a = 0, (8) 
reduces to (13). The equivalence for a general point source for an electromagnetic 
as well as meson field can be similarly established. 


References 

Fremberg, N. F. 1946 Proc, Roy, Soc, A, 188, 18. 
Frenkel, J, 1926 Lehrhuch der EUMrodynamic^ p. 177. 
Ma, S. T. 1947 Phys, Em, 71, 787. 

Fryce, M. H. L. 1938 Proc, Roy, Soc, A, 168, 389, 


Vol. 198 A. 


12 



Hot-wire investigation of the wake behind cylinders 
at low Reynolds numbers , 

* By L. S. G. Kovaszn-ay 

Depo/Ttment of Aeronautics, The Johns Hopkins University ^ 
Baltimore, Md., U. 8 ,A, 

(Communicated by Sir Geoffrey Taylor, F.R.S. — Received 17 December 194:8) 


The hot-wire technique has been used to measure the regular vortex street pattern behind 
a cylinder at low Reynolds number. Measurements of mean velocity distribution were made 
both below and above the critical Reynolds number at which the periodic motion appears. 

Amplitude and phase measurements gave sufficient information for computation of the 
instantaneous flow pattern of the vortex system. The important points resulting from the 
investigation are that (z) the critical Reynolds number at which vortices are shed is 40, 
(ii) in the range of Reynolds numbers investigated the vortices are not shed directly frozn 
the cylinder but appear some distance downstream as an instability of the laminar wake. 


Introdtjctioh 

The regular pattern of the vortex street behind cylinders at low Reynolds numbers 
has attracted considerable attention in the past (K^irm4n 1912). The theoretical 
treatment of the problem, since it is based on the assumption of a perfect fluid, does 
not account for the Reynolds number effect. The time frequency of the vortex 
pattern varies with Reynolds number, and this variation has been measured by 
several authors (Fage & Johansen 1927; Relf & Ower 1923; Richardson 1923; 
Rosenhead & Schwabe 1930). 

The sudden appearance of the periodic wake at a certain Reynolds number with 
a definite pattern indicates a laminar instability phenomenon. 

The hot-wire anemometer seemed to be a suitable tool for detailed measurements 
at the low speeds and small linear dimensions involved. 

The work was started at the Department of Aerodynamics, Royal Hungarian 
University for Technical Sciences, in 1943; it was interrupted during the war, and 
continued in the Cavendish Laboratory, Cambridge, England. 


Equipment 


The equipment used in Budapest and that used in Cambridge were quite similar : 


wind tunnel working section 
speed range 

background turbulence level 
hot wires (platinum) 
hot-wire equipment 


Budapest (1943) 

20 by 20 in. 

1*5 to 80 ft./sec. 
0*18 % 

0*0002 in. diam. 
see Kovasznay (1947) 


Cambridge (1946) 
16 by 15 in. 

1 to 60 ft./sec. 
0*06 % 

0*0001 in. diam. 
see Townsend (1947) 


Measurement of the velocity of the undisturbed flow was made in Budapest by 
Pitot head and high-sensitivity Reichart type torsion balance micromanometer. 
This gave an accuracy of 1 % down to velocities as low as 50 cm./sec. 

[ 174 ] 



Hot-wire investigation of the wahe behind cylinders 175 

In Cambridge no micromanometer of this low range was available, so a special 
low-speed meter was built. 

The need for measuring very low mean speed first suggested the use of hot-wire 
technique. But since the calibration of a thin hot wire is reliable for only a limited 
interval of time, a more rugged instrument was desirable. The instrument finally 
built also worked on the cooling effect of the air stream. 

A steel hypodermic needle (0-7 mm. outside diameter) was heated by 5 to 10 amp. 
a.c., from the secondary windings of a special transformer. A copper-oonstantan 
thermocouple was placed in a glass capillary inside the hypodermic needle. 

Two identical units were made, and the balanced output of the two thermo- 
couples, oppositely connected, was fed into a mfilivoltmeter (Weston, 5 mV range, 
5 £2 resistance). One unit was placed in the air stream, the other was kept at constant 
mean temperature of the room, and served as reference for maintaining constant 
heating current. 

The instrument was calibrated in the wind tunnel in the following way: a thin 
(0-060 in.) resistance wire was mounted across the wind tunnel and heated by d.c. 
with a superimposed a.c. (50 c./sec.), resulting in a periodic temperature fluctuation 
in the wake. The heat ‘ patches ’ could be identified in the laminar wake of the wire 
quite well by comparing the signal of an exploring hot-wire anemometer with this 
a.c. heating voltage, in particular, by noting the phase relation. Prom the wave- 
length of the heat pattern in the wake, the mean speed was determined. (The 
instrument was usable in the speed range of 1 to lOft./sec.) 

Meastjeement of feeqtjenoy and of critical Reynolds number 

The frequency of the periodic wake was observed with the hot-wire anemometer. 
The maximum signal was obtained when the hot wire was located 5 to 10 diameters 
behind the wdre and sHghtly off the centre of the wake. 

The hot-wire signal was amplified and fed into the horizontal deflexion system 
of a cathode-ray oscilloscope; an audio-frequency oscillator was used for the vertical 
deflexion. Thus when the two frequencies were equal, the Lissajous figure became 
an ellipse. 

This method gave sufficient accuracy at relatively low Reynolds numbers where 
the motion was extremely regular; but at higher Reynolds numbers, decreasing 
regularity reduced the accuracy. 

The results are expressed in the form of Strouhal number: 

S{B) , where B = 

Uq V 

where / is the observed frequency, d the diameter of the wire, the undisturbed 
mean speed, v kinematic viscosity. Pigure 1 is a logarithmic plot of the results using 
wires of various diameters. 

Careful measurements were made to determine the Reynolds number at which 

the periodic wake appears. The critical Reynolds number was found to be Rent = 

This value was obtained with wires of very high length/diameter ratio (the length 


12-2 



176 


L. S. G. Kovas 23 aay 



Figtibe 1 


Table 1. Mean velocity disteibittion in the laminar wake, iZ = 34 



yid 

UIU , 

y/d 

- 4-76 

1-018 

- 3*95 

-413 

1-037 

- 2-66 

- 3‘65 

1-046 

- 1-98 

- 2-87 

1-073 

- 1-40 

- 2-25 

1-110 

- 1-07 

- 1*65 

1-165 

- 0-76 

- 1*26 

1-090 

- 0-42 

- 0-98 

0-823 

0 

- 0-81 

0-550 

+ 0-26 

- 0-64 

0-300 

+ 0-58 

- 0*33 

0-094 

+ 0-83 

- 0-02 

0-065 

+ 1-12 

+ 0*32 

0-102 

+ 1-46 

+ 0-65 

0-326 

+ 2-11 

+ 0-81 

0-578 

+ 3-34 

+ 0*99 

0-833 


+ L 50 

1-018 


+ L 93 

1-095 


+ 4*88 

0-990 

* 


X = lOdJ 

A 



Vld 

UjU , 

- 3-79 

1-000 

- 2-55 

0-946 

- 1-90 

0-915 

- 1-27 

0-742 

- 0-81 

0-596 

- 0-48 

0-454 

+ 0-10 

0-355 

+ 0-53 

0-482 

+ 0-93 

0-672 

+ 1-26 

0-808 

+ 1-89 

0-974 

+ 2-54 

1*035 

+ 3-85 

1-005 


UlUo 

yId 

U/Uo 

1-035 

- 3*40 

1-012 

1*053 

- 2*76 

1-012 

1*035 

- 2*17 

1-012 

0*910 

- 1-60 

0-867 

0*687 

- 0-81 

0*468 

0-425 

- 0*50 

0-280 

0-216 

0 

0*180 

0*153 

+ 0-29 

0*243 

0-193 

+ 0*78 

0-506 

0*348 

+ 1-18 

0-784 

0*538 

+ 1-79 

0-950 

0*738 

+ 2*41 

0-988 

0*884 

+ 3-02 

0*988 

0*960 



0-973 




X = 20d 


Vld 

UIVo 

- 3-79 

1-000 

- 3-15 

1-000 

- 2-54 

0-891 

- 1-89 

0-861 

- 1*26 

0*755 

- 0*65 

0*630 

+ 0-02 

0-596 

+ 0-65 

0-630 

+ 1*30 

0-865 

+ 1-96 

0-983 

+ 2-56 

1-015 




177 


Hot-wire investigation of the waTce behind cylinders 

of the wire was always 50 cm.). The transition does not seem to have any 
hysteresis; the critical value is the same for both increasing and decreasing mean 
velocity. 

If the velocity is set very slightly below the critical value a slight disturbance, 
e.g. 'plucking’ the wire, started the development of periodicity, but it decayed 



Fioxjbe 2 

slowly as the external disturbance ceased. This phenomenon gives strong evidence 
for the existence of laminar oscillations in the viscous wake at a certain critical 
Reynolds number. In earher measurements the wall effects seemed to modify the 
results (Thom 1933 ). 



178 


L. S., G. Kovdsznay 

Table 2. Mbah velocity and b..m.s. velocity eldotuation 
IN THE PBBIODIC WAKE, i? = 56 


II 



X = 3*5d! 



II 





A 



A 


r 

\ 

f 

m 

A 

r 



yid U/Uo 

100 — 

yid 

UlUo 

vf 

100 — 

C/o 

yid 

UlUo 


-3-64 1-000 

0-17 

-5-23 

1-000 

0-26 

-5-04 

1-000 

0-43 

-3-02 1-045 

0-38 

-2-82 

1-070 

1-01 

-3-64 

1-010 

0-76 

- 1-73 1-120 

0-72 

-1-93 

1-080 

2-47 

-3-04 

1-020 

1-34 

- 1-41 1-120 

0-88 

-M6 

0-930 

6-78 

-2-39 

1-020 

2-63 

- 1-07 1-000 

1-02 

-0-84 

0-660 

8-70 

-1-78 

1-025 

5-00 

-0-86 0-773 

1-06 

-0-53 

0-289 

6-97 

-1-14 

0-880 

9-11 

-0-51 0-152 

0-42 

-0-22 

0-080 

1-94 

-0-79 

0-639 

12-18 

-0-24 0-048 

0-12 

-0-10 

0*068 

1-13 

-0*51 

0-472 

12-05 

0 0-039 

0-07 

0 

0-062 

0-70 

-0-39 

0-403 

9-95 

+ 0-25 0-048 

0-15 

+ 0-13 

0-068 

1-09 

-0-19 

0-300 

6-96 






-0-09 

0-305 

5-58 






0 

0*281 

3-96 






+ 0-11 

0-276 

4*88 






+ 0-25 

0-291 

7-65 






+ 0-25 

0-351 

7-63 






+ 0-42 

0-443 

10-63 

00 

II 



X = 12d 



X = 20(i 





A 




A 


r 

A 




f 


" ^ 

yId UlU, 

u' 

100 — 
u. 

yid 

UlU, 

O 

o 

1— 1 

yid 

UlU, 

100 ~ 
Uo 

-6*36 1-000 

0-09 

- 10-32 

1-00 

0-19 

- 13-30 

1-00 

0-24 

-5-65 1-020 

0-32 

-6-72 

1-00 

0-40 

-6-36 

1-00 

0-32 

-4-36 1-040 

0-95 

-5-08 

1-00 

0-74 

-5-21 

1-00 

0-91 

-3-73 1-030 

1-64 

-4-45 

1-00 

1-20 

-4*67 

1-00 

1-46 

-3-10 1-030 

2-87 

-3-81 

0-99 

2-16 

-3-94 

1-00 

2-52 

-2-50 1-020 

5-38 

-3-18 

0-99 

3-95 

-3*30 

1-00 

4-24 

- 1*84 0-990 

8-70 

-2-64 

0-97 

6-66 

-2-67 

0-96 

6*60 

- 1-54 0-960 

11-32 

-1-90 

0-92 

9-10 

-2-03 

0-92 

5-96 

- 1-23 0-870 

13-50 

-1-46 

0-85 

9-55 

-1*40 

0-86 



-0-86 0-805 

13-38 

-0-83 

0-77 

7-40 

-0-94 

0-81 

3-61 

-0-47 0-635 

9-13 

-0-61 

0-74 

6-04 

0 

0-79 

2-30 

0 0-590 

5-18 

-0-29 

0-73 

5-00 

+ 1-11 

0-85 

4-61 

+ 0-51 0-647 

9-32 

0 

0-69 

4-25 




+ 0*84 0-720 

11-85 

0 

0-69 

4-04 




+ 1-12 0-800 

11-95 

+ 0-33 

0-72 

8-42 




+ 1-66 0-945 

9-40 

+ 1-41 

0-86 

8-42 




+ 2-26 0-980 

5-62 







+ 3-19 0-980 

2-14 







+ 4-63 0-980 

0-69 








x = iOd 

A 




X = lOOd 

A 



yid 

UlU, 



/ 

yid 

UlUo 

100 

u' 

Uo 

-11-6 

1-00 

0-14 


-4-07 

1-00 

0-43 

-6-3 

1-00 

0-37 


-2-54 

0-96 

0-90 

-5-4 

1-00 

1-00 


-1-91 

0-91 

0-88 

-4-12 

1-00 

2-30 


-1-27 

0-91 

0-59 

-2-86 

0*96 

3-35 


-0-64 

0-87 

0-38 

-2-22 

0-89 

2-63 


0 

0-83 

0 


-1-59 

0-86 

1-86 


+ 2-03 

0-88 

0-76 

-0-95 

0-83 

1-27 






0 

0-80 

1-00 








Hot-wire investigation of the wake hehind cylinders 


179 


Measubements oe mean elow 

Experiments were carried out to obtain the velocity distribution behind a cylinder 
both slightly below and slightly above the critical Reynolds numbers. The cylinder 
used was a 1 mm. steel wire. The measurement was difficult because of the low speed 
and the necessarily small traversing steps. 



y/d 
Figueb 3 


The hot wire should not be supported by long prongs parallel to the flow, because 
this caused an instability in the wake that propagated upstream. Therefore the probe 
had to enter the wake from the side. This caused slight asymmetry in the reading, 
but it was assumed that the half-distribution measured before the centre was reached 
was unaffected. The data are given in table 1 . Figure 2 shows the velocity distribution 
for the laminar case at — 34. 






Hot-wire investigation of the wake behind cylinders 181 

The measurement of the mean velocity in the periodic wake is more difScult. As 
the velocity fluctuates, the mean readings of the hot wire are affected by the non- 
linear response, so that a ‘ rectifying effect ’ appears. Correction of the mean velocity 
readings was made on the basis of non-linear hot-wire characteristics with the use 
of the velocity fluctuation measurements at the same point. The treatment is identical 
with the computation of the distortion of a sine wave due to non-linear distortion of 
an amplifier stage. 

The fluctuations made detection of zero mean velocity almost impossible, since 
a hot-wire set parallel to the z-axis measures absolute value of the velocity vector 
in a two-dimensional {x, y) flow. The data of mean speed and fluctuation are given in 
table 2. 

Figure 3 shows the mean velocity distribution behind the 1 mm. diameter wire at 
Reynolds number JS = 56. The vortex street is fully developed. 



Figure 6 (i? = 66) 


Mean flow pattern 


The two-dimensionality of the flow was experimentally verified. Thus the hot- 
wire measurement gives the absolute magnitude of the velocity, and therefore 
a stream function exists: 




dy 




^0 

dx 




dn ’ 


•where Uy are the components of mean velocity and n is the direction -perpen- 
dicular to the stream lines. If ?7(a;,2/)is kno-wnjfrom measurement, the'Vi^o = constant’ 
lines (streamlines) can he constructed graphically by using small circles "with diameter 
l/U, and drawing tiie ‘li-'o = constant’ lines tangential to the circles. 

The data given in figures 2 and 3 were used to construct figures 4 and 5. 



182 


L. S, G. Kovasznay 

Of course reversed flow cannot be detected with the standard type of single hot- 
wire anemometer. The scale of the flow phenomenon was so small that the use of 
direction meters was impracticable. Therefore it can only be stated that the shaded 
areas are probably regions of reversed flow. In these regions no substantial change 
in stream function could be detected. 



Figtjbe 6 



Figure 7 

The main difference between the mean flows below (figure 4) and above (figure 5) 
the critical Keynolds number is that in the simple laminar flow the wake is more 
extensive. In the vortex street, the higher shearing stresses even out the differences 
more rapidly . In these two figures the spacing of the contour lines corresponds to 




8 




184 


L. S. G. Kovasznay 

Measttkement oe ‘eoot-mean-sqxtare’ elttctuations 

Since the velocity fluctuations in the vortex street were almost one order of 
magnitude s^n llar than the mean velocities it was possible to determine the amplitude 
of this periodic motion by a very convenient method; the root-mean-square velocity 
fluctuation, as determined by a simple hot-wire ‘w'-meter’ gave a good measure of 
this amplitude. Just as in the case of measurement of low turbulence levels, the 
second order contribution from the v' component of velocity fluctuation can be 
neglected. 

The data for the mean flow and the fluctuations are given in tables 1 and 2. It is 
remarkable that the fluctuations close to the cylinder are very small and that they 
develop a maximum of intensity only in the vicinity of aj = The distribution of 
fluctuation in travearses at xjd = 2, 3-5, 6, 8, 12, 20, 40, are given in figure 8. The 
individual diagrams are staggered, and the ordinate scales alternate between left 
and right sides of the figure. 



0 5 10 15 20 

x/d 

FiGxniE 9 (2? = 56) 


The closest traverse, at xjd = 2, shows that very near to the cylinder there is 
practically no fluctuation present. Thus, the vortices are not ' shed ’ from the cylinder 
at this low Reynolds number, but develop several diameters downstream. Measure- 
ments were made on one side of the wake only, because, as mentioned earlier the 
hot-wire holder interfered with the wake when it crossed the middle portion. The 
oscillograph records showed that the instantaneous pattern then became slightly 
unstable. 

The maxima of the fluctuations are plotted against downstream distance in 
figure 9. The maximum value of 14 % occurs at xjd = 7. 

In general the fluctuations studied here are of pure periodic type; the pattern 
remained unchanged on the oscilloscope screen for hours, showing a surprising 





186 


L. S. G. Kovisznay 

stability. As was to be expected for such a flow pattern, the fluctuation at the middle 
of the wake was always pure double frequency. The double-frequency component 
decreased rapidly with distance from the wake centre, and disappeared almost 
completely before the point of maximum fluctuation was reached. 

The contour diagram of the r.m.s. fluctuations was constructed in figure 10. The 
difference between contour linesis2%oftheimdisturbedmean speed. (Am'/I/o = 0-02.) 


iNSTANTANEOlTS-BTiOW PATTBEN 

Since the flow pattern is completely regular and stable at Eeynolds numbers not 
exceeding E = 160, phase measurements were possible so that the complete velocity 
distribution could be obtained. 



exploring hot-wire 
Figueb 11 


the block diagram of the electronic set-up used in point-to-point 

was ^ reference for timing, and the other 

^ niechanism. The reference hot wire was used to 

Th! w Jase (i.e. the sweep frequency) of the cathode-ray osciUoscope. 

m ^ reproducible osciUogram could be obtained, 

oscfllosco^^ we was moved along, the pattern on the screen of the 

de^Tfd therefore phase relations could be easily 

J w3en -S'® the spac4 

g e pattern. Typical oscillograph records are shown in figure 12. 





187 


Hot-wire investigation of the wake behind cylinders 

For these records, xjd = 6, and the value of yjd is given on each record. The records 
were taken from synchronized steady patterns with exposures ^ sec. Slight intensity 
variations are noticeable due to stroboscopic effect. 

The relative phase of second and first harmonics varies with distance downstream. 
The first harmonic is asymmetric with respect to the jr-axis, the second is symmetric. 
The phase relations are shown in table 3. 


yjd-O 




Figxtbe 12 


The velocity fluctuation distribution can be represented at any mstant 

JJ = XJ^{x, y) cos 27rKi(a7) + U^ix, y) cos +/^], 

where and vary slowly with x but rapidly with y. is the odd function of y; 

is the even function. Ci(^) ^ 2 (^) almpst linear functions of x, taking care 

of non-uniform spacing. The time frequency / is constant because the sequence of 
eddies is continuous. 



188 


L. S- G. Kovasznay 


Table 3. Phase of the first akb second components 
OF THE PERIODIC WAKE AT iJ = 56 



xjd 

^2 

xjd 

0 

2 

0 

2 

0*5 

7*5 

0*25 

5*2 

1-0 

12*0 

0*50 

8*0 

1-5 

16-2 

0*75 

10*5 

2-0 

20*0 

1*00 

13*0 

2*5 

23*7 

1*25 

15*5 

3*0 

27*4 

1*50 

17*5 

3*5 

3 M 

1*75 

19*3 

4-0 

34*8 

2*00 

20*9 

4*5 

38*5 

2*25 

23*0 



2*50 

25*1 



2*75 

27*2 



3*00 

29*5 



3*25 

31*3 



3*50 

33*0 



3*75 

34*9 



4*00 

36*7 



4*25 

38*4 


The instantaneous total stream function '^{x^y) was constructed by using the 
following simplifications. 

( 1 ) was computed as ^2 'it' in the region where no double frequency was visible. 

(2) TJ 2 was computed on the assumption that it vanished parabolically in the 
±y directions: 

where y-^ is the value of y for which the total fluctuation is a maximum. 

(3) The velocity fluctuation measured by the hot wire was taken identical with 
velocity fluctuation in the aj-direction. 

The flow was constructed from the co-ordinate system fixed to mean motion of the 
fluid; the velocity is 0 at infinity. The computation of ^ was made in three steps: 
(a) the stream function of the mean flow (^q), (6) the first harmonic of the fluctuation 
(^) second harmonic (^g)- The resulting pattern is given in figure 11, and 
the lines correspond to differences in the stream function, = 0*l?7od, the dotted 
lines are half-values between two full lines. The streamline pattern in figure 13 is 
viewed relative to the undisturbed flow at infinity. The diagram clearly shows the 
development and decay of this motion. 


^ Conclusions 

The experiments have established the following facts: 

(1) The critical Reynolds number, i.e. that at which the vortex street sets in, is 
a well-defined value for infinitely long circular cylinders, and the pattern that appears 
is unique and stable. 

(2) The ' vortices ^ develop some distance downstream within the Reynolds number 
range 40 to 160, and are not shed directly from the cylinder. Consequently the 




190 L. S. G. Kovasznay 

phenomenon can properly be considered as an instability of the laminar wake, that 
develops up to an amplitude limit, but dies out before becoming turbulent, at least 
in this low Reynolds number range. 

The major part of the present work was done in the Cavendish Laboratory, and 
the author wishes to express his indebtedness to Sir Geoffrey Taylor for his super- 
vision and criticism. Thanks are due also to A. A. Townsend for making available 
his hot-wire apparatus. 

The author was holding a British Council research scholarship when carrying out 
the research. 


REFEBBlSrOES 

Page, A. & Johansen, F. C. 1927 Proc. Roy. Soc. A, 116, 170. 
Kdrman, Th. v. 1912 Nachr. Qes. Wiss. QoUingmy p. 547. 
Kovasznay, L. S, G. 1947 N.A.G.A. Tech. Mem. no. 1130. 

Relf, E. F. & Ower, E. 1923 A.B.C. Reports dh Memoranda, no. 825. 
Richardson, E. G. 1923 Proc. Phys. Soc. 36, 153. 

Rosenhead, L. & Schwabe, M. 1930 Proc. Roy. Soc. A, 129, 115. 
Thom, A. 1933 Proc. Roy. Soc. A, 141, 651. / 

Townsend, A. A. 1947 Proc. Carnb. Phil. Soc. 43, 560. 


Plastic deformation of silver chloride 
I. Internal stresses and the glide mechanism 

By J. F. Nye, Cavendish Laboratory, University of Cawbridge 

{Communicated by E. Orowan, F.R.S. — Received 20 December 1948 — 
Revised 4 April 1949 — Read 19 May 1949) 

[Plates 5 to 8] 

The experiments of Obreimow & Sohubnikoff (1927) on th© birefringence produced by the 
plastic deformation of single crystals of rock salt have been extended to a polycrystalline 
material. Rolled sheets of silver chloride have been recrystallized and then deformed plastic- 
ally in various ways — ^by simple extension and by bending, for example. The sheets are trans- 
parent and very ductile and, since silver chloride is cubic in structure, th© birefringence 
patterns observed imder th© microscope provide a picture of th© distribution of the internal 
stresses -uncomplicated by natural double reJ&actions. It is suggested that results obtained 
with this optical method are applicable to metals. 

Silver chloride appears to deform by glide, and when th© glide packets are observed on edge 
a characteristic pattern of parallel birefringent bands is visible. The relation of the glide plane 
and glide direction to th© crystal structure has been studied by making observations upon 
these bands and upon th© glide lines formed on the surfaces of bars of square cross-section 
consisting effectively of chains of single crystals. Th© orientations of the fifteen sets of glide 
bands examined in this way were all consistent with glide movements in a <110> direction; 
the glide plane, however, was not always a crystallographic plane of low indices. In th© six 
cases in which th© measurement was possible, it lay within 9° of the plan© in the <110> zone 
on which the maximum shear stress, resolved in the <110> direction, acted. It is concluded 
that silver chloride deforms by ‘pencil glide’, th© mechanism postulated by Taylor & Elam 
in 1926 to explain th© plastic behaviour of a-iron. The transmission of pencil glide across 
grain boundaries is discussed. 



191 


Glide in silver chloride 

The residual stresses observed by the optical method in polyorystalline sheets may be 
divided into three groups: (1) A system of stresses set up between the glide zones of each 
grain and alternating with a period equal to the spacing of the glide zones. A detailed aioalysis 
of these is given in the second paper (part II). (2) Alternating stresses produced when a 
system of glide zones meets a grain boundary. (3) ‘Heyn stresses’ produced by the non- 
uniformity of plastic deformation from grain to grain. 


1. Inteodtjction 

If one deforms a crystalKne body plastically and removes the external forces 
it is found that stresses remain locked up in the interior. Internal stresses of 
this type may be present both in single crystals and in polycrystalline aggregates, 
and much attention has been paid in recent years to their occurrence in cold-worked, 
polycrystalline metals. The X-ray diffraction method, which has been the principal 
means of investigation, unfortunately suffers at present from a serious limitation: 
the stresses that are investigated are often distributed in space in patterns of micro- 
scopic scale, and they may vary considerably in distances comparable with the 
width of the X-ray beam. For this reason, an X-ray beam, while often capable of 
indicating the presence and the magnitude of internal stresses, is too coarse an 
instrument to use for a detailed exploration of their distribution in space. 

Another method of approach to the problem, which is both powerful and direct, 
is to make use of the photoelastic effect in transparent crystals. Cubic crystals are 
particularly suitable, for, shace they axe not naturaUy doubly refracting, their 
birefringence gives a direct picture, subject to certain restrictions, of the magnitude, 
direction and spatial distribution of the internal stresses. 

In the present state of the theory of plastic deformation of crystalline materials 
the restriction to transparent materials, inherent in this method, is not serious. 
Since there is no apparent coimexion between' the plastic properties of a crystalline 
substance and its transparency or opacity, it is reasonable to assume that results 
derived from experiments on a transparent material will be applicable to other 
substances whose mechanism of plastic deformation is essentially similar. In 
particular, the results should be applicable to metals. 

The first observation of artificial birefringence caused by plastic deformation of 
a crystal seems to have been made in 1815 by Brewster (i8i8, 1853). The "lamellar 
polarisation’ that Brewster noticed in rock salt, diamond and certain other cubic 
crystals was later studied by Biot, but neither experimenter appears to have 
appreciated that in some of these crystals it was due to plastic deformation (Reusch 
1867). More recently, Obreimow & Schubnikofif (1927) have made careful obser- 
vations of the way the birefringent bands in rock salt appear during the course of 
plastic deformation, and the nature of the deformation has been further investigated 
by Brilliantow & Obreimow (1934, 1937) by the use of an ingenious combination of 
X-ray and optical methods. The photoelastic method does not seem to have been 
used to explore, in detail, the plastic deformation of any other crystals, nor has it 
been applied to the internal stresses arising in polycrystaUine materials. To approach 
this problem a substance is needed that is transparent, polycrystaUine, ductile, and 
preferably optically isotropic in the unstressed state. SUver chloride possess^ all 

13-2 



192 J. F. Nye 

the required properties and, in particular, is very ductile at room temperature. Its 
exceptional qualities have earned it the name of 'transparent metal’. 

In this paper we try to show how the characteristic appearance, under the polar- 
izing microscope, of deformed sheets of silver chloride may be explained as the 
result of internal stresses. There follows a description of experiments to determine 
the mechanism by which silver chloride deforms plastically, from which we conclude 
that the process is 'pencil glide In part II of this series, with the same general title 
(Nye 1 949) , which may be read independently, we give a detailed qualitative explana- 
tion of the photoelastic pattern produced between the glide zones in terms of the 
stresses caused by dislocations. 

In addition to the effects of plastic deformation, various optical phenomena make 
their appearance in the sheets either without stress being apphed, or in the elastic 
stress range. They are interesting as curiosities of the behaviour of crystals ui 
polarized Ught, but, as most of them have only an indirect connexion with the plastic 
properties, they are being dealt with in a separate paper. 

Short accounts of the experiments have been given in Nature, (Nye 1948 a, b) and 
some of the phenomena were demonstrated at a Conversazione held by the Royal 
Society on 29 May 1947. 


2. Preparation oe the sheets 

The silver chloride used in the experiments was obtained from the Harshaw 
Chemical Company of Cleveland, Ohio, U.S.A. It is understood that large blocks of 
the material are formed from the melt under carefully controlled conditions of 
solidification and cooling, and are then hot rolled. Sheets of thicknesses from 0-3 to 
1-Omm., measuring about 5 x 6cm., were supplied. A Leitz Polarizing Microscope, 
Model CM, was used for the observations. Under ordinary illumination these sheets 
appeared transparent and colourless, the only marks being scratches on the surface 
caused by the roUing; but when viewed in white light between crossed nicols, at low 
magnification, they showed many light areas, for the roUing treatment, as was to be 
expected, had left them with residual stresses. The sheets were so highly distorted 
that only rarely could one grain be distinguished from another; the grain boundaries 
could be made visible, however, by etching for a few seconds in a strong solution 
of sodium thiosulphate. Most of the experiments to be described were made on strips 
measuring 30 x 5 mm. cut from these sheets with a razor blade. The strips were first 
recrystallized by heating in air for 2 hr. at 400^^ C (the melting-point of silver chloride 
is 465'’ C). After cooling in a furnace (this point will be returned to later in § 9), they 
were again examined under the microscope between crossed nicols. All the light areas 
had disappeared and the strips were therefore judged to be free from all but the 
smallest internal stresses. 

The boundaries between the new, recrystaUized, grains could be seen on the 
surfaces of the strips in unpolarized light, when they appeared as .fine black 
lines. The grain size was such that in most places the strips were only one grain 
thick. 



Glide in silver chloride 193 

3. The befbct of plastic DSBORMATioisr 

With sheets of the thicknesses used, the path differences caused by stress bire- 
fringence were always less than the wave-length of light. The colour of the restored 
light was usually white; occasionally, with high loads and thick strips, the first order 
yellow was reached. 

The stress-strain curve of an annealed single crystal of silver chloride, as measured 
by Stepanow (1934), is similar in shape to that of annealed copper in that the elastic 
limit (6*9 bars)* is small compared with the stresses that can be withstood by the 
material after it has been hardened by strain (of the order of 150 bars). Thus, quite 
a small force in an annealed specimen is sufficient to cause the first trace of plastic 
deformation; a strip after recrystallization is about as rigid as a piece of cardboard 
of the same thickness. That a permanent change has taken place can be detected by 
the fact that, when the external deforming load is taken away, the strip no longer 
appears completely dark in a polariscope. Tigures 1 a and 6, plate 5 show a plastic- 
ally deformed sheet between crossed nicols. Figure 1 a was taken while the external 
deforming load was still acting; figure 16 was taken after it had been removed. 

The light patches that are observed after removal of the load fall broadly into three 
classes. The majority are irregular in shape and, in general, smaller — ^though not 
much smaller — in area than the grains. Some grains, however, are seen to be crossed 
by straight bands of light. It is possible to see these bands forming while the external 
stress is being applied. At first they are widely spaced but, on further deformation, 
new ones form between the old ones and, in addition, grains and parts of grains which 
before were stiU in the elastic range are suddenly traversed by a series of these bands. 
We shall call them ‘ birefringent bands’- The third type of residual birefringence is 
only seen at grain boxmdaries. The deformed strips contain residual bending moments 
and, while such a stress system is invisible in the grains themselves, it shows up, for 
reasons described elsewhere (Nye 1948 a), at grain boundaries. This bending may 
consist, in general, of couples acting about two perpendicular axes lying in the plane 
of the sheet. (If the couples are equal and opposite we have the special case of torsion 
about an axis in the plane of the sheet.) 

Examples of all three types of residual birefringence can be seen in figures 16,2 a, 
plate 5 and 3, plate 6. There are several sets of birefrmgent bands in these photo- 
graphs, and, in figure 3, more than one set in the same grain. Some of the irregular 
patches of light have a streaky appearance. Sharp sets of bands are illustrated in 
figures 4, 5, plate 7 and 66, plate 8. 

If the deformation is continued, fine lines begin to appear on the surfaces of the 
sheet. In distinction to the birefringent bands and the residual stress patterns 
referred to above, these lines are visible without the aid of polarized light. They 
resemble the slip bands seen on metallic surfaces which have been pohshed and then 
deformed. Lines of this type are seen in figure 6a, plate 8, which shows the same 
area as figure 66 but without the analyzer. They show up in reflected as well as in 
transmitted light and examination under oblique illumination makes it clear that 
they are fine steps on the surface of the sheet. To estimate their height the ‘ shadow- 

* 1 bar = 10®d3nies/cm.2, 



194 J. F. Nye 

casting’ technique developed for electron microscopy was used. When a beam of 
gold was made to strike a sheet at a glancing angle of 6°, subsequent measurements 
under an optical microscope of the widths of the ‘ shadows ’ showed that the most 
prominent steps had a height of about 4000 A. 

It would appear that these steps are due to glide, for, although both the other, 
fundamental mechanisms of plastic deformation, twinning and kinking, can cause 
a surface to be broken up into facets making angles with each other, they cannot, 
by their nature, give rise to steps. The plastic behaviour of silver chloride is thus 
different from that of sodium chloride, which, at room temperature, deforms mainly 
or entirely by kinking (Brilliantow & Obreimow 1937; Orowan 1942). 

If the deformation is by glide it is natural to ask whether the birefringent bands 
represent the systems of stress set up in the glide packets between successive glide 
zones. If this is so, the fact that the bands vary considerably in sharpness has a 
natural’explanation; for they should appear sharp only if the glide plane is observed 
edgeways on. When it is inclined to the direction of the light the different parts of the 
birefringence pattern in each glide packet will overlap and the details will be obscured. 
If the angle of inclination is further increased, the stress patterns in different glide 
packets will overlap and, although the streakiness in the pattern may not be com- 
pletely obliterated, the details wiU be quite lost. In agreement with this explanation, 
it was possible to increase the sharpness of some of the less distinct systems of bands 
by rotating the specimen under the microscope about an axis parallel to the bands. 
A Leitz graduated tilting stage was used for the purpose. It was not possible to 
make the bands in all the grains sharp by rotation, but, since the critical angle for the 
refraction of light passing from silver chloride into air is only 28° 62', this is not 
surprising. In general, when air is used as the surrounding medium, we could only 
expect to be able to rotate to sharpness those bands in which the planes are inclined 
to the normal at angles withiu the range — 28° 62' to 28° 62'. In practice, owing to 
mechanical difficulties, this range is considerably reduced. 

The connexion between the birefringent bands and the glide lines on the surface 
must now be discussed. Some grains contain a system of birefringent bands with 
a series of glide lines running parallel to them. However, bands are frequently seen 
that are not accompanied by a parallel series of glide lines — this is especially the 
case in stretched or bent sheets — and, conversely, most glide lines do not run parallel 
either to a system of bands or to any streakiness in the birefringence pattern. It can 
be observed, however, that the irregular patches of birefringence which show up 
most brightly in a circular polariscope are those crossed by the most prominent glide 
hues. Figures 5, 6 a and 6 provide examples of these facts. The set of bands in the 
upper grain of figure 66 have the same direction as the vertical set of glide lines, but 
another set of glide lines crosses the same grain in a horizontal direction and does not 
appear to affect the birefringence pattern to any great extent. On the other hand, 
no sign whatever could be found of any glide lines following the directions of the sets 
of bands in figure 6 (although a set of glide lines can be seen crossing them at right 
angles). In such cases as this, plastic deformation has taken place in a way that 
leaves no trace on the surfaces; a surface examination of the usual metallographic 
type would have failed to reveal it. 



Glide in silver chloride ' 


195 


The explanation of these observations is simple. Birefdngent bands without 
glide lines occur when the glide direction is parallel, and the glide plane is approxi- 
mately pei^endioular, to the surface of the sheet. Glide lines without birefringent 
bands occur when the glide planes make such large angles with the normal to the 
sheet that the birefringence pattern is obscured by overlapping. Experiments on 
rectangular bars (see §4(6)), which could be viewed through two surfaces approxi- 
mately at right angles, confirmed this. 

4. Expbeiments to determine the qlidb elements 
(a) Experiments on sheets 

The next step in the investigation was to find the crystallographic indices of the 
ghde plane and glide direction. In the experiments now to be described the crystal- 
lographic orientations of six grains showing particularly sharp birefringent band 
systems were found by taking X-ray Laue photographs. These grains were part of 
deformed polycrystalline sheets. Two of them are illustrated in figure 5 (strip no. 4) 
and the other four in figure 4 (strip no. 19). It is unfortunate that the stress system 
which produced them is not known. The sheets had already been bent between the 
fingers and then the bands appeared quite suddenly as the strips were being mani- 
pulated on a microscope slide. They were possibly the result of a combination of 
bending and torsion. Band systems of this sharpness rarely occur in strips pulled 
in tension. 

The camera used was a Unicam universal instrument arranged for back-reflexion. 
The X-ray beam was made sufl&ciently narrow ( ~ O-S mm.) to faU only on one grain 
at a time, and this made it possible to take separate back-reflexion Laue photographs 
of each of the six chosen grains. The X-rayreflexions from some of the grains were 
sharp, being about 0-6 mm. in width with a crystal to film distance of 26 mm., but 
often they were drawn out to lengths of 1 to 2 mm. Occasionally the spots were 
noticeably double, with separations of from 0-6 to 1mm.; this corresponds to 
rotations of about 0-6 to 1°. The indices of ttie spots were found by the method 
described by Greninger (193s), and a stereographic projection was plotted for each 
grain showing the positions of the poles of the crystallographic planes in relation to 
the faces and edges of the strip in which it was situated. Each gram in turn was then 
rotated under the microscope by means of the tilting stage until the bands in it 
appeared to be most clearly resolved. By noting the angle of tilt and allowing for 
refraction at the surface the inclination of the birefringent lamellae was calculated. 
In this way the pole, P, of the glide plane was plotted on the stereographic projection. 
Table 1 gives the various angular relationships for each set of bands, derived with the 
aid of a stereographic net of radius 15 cm. Each fine of the table refers to one set of 
birejBringent bands, specified in column 3. In column 4is entered the ciystallographic 
plane of low indices having its pole nearest to P. The agreement between glide plane 
and crystal plane can best be judged from the angles a and /? in columns 5 and 6. 
The figures given for a show that the ghde planes are in each case (except for the 
doubtful set C in grain 1 of strip 19) close to either a {111} plane or a {112} plane. 
If the planes were really coincident with these crystallographic planes, however, the 



196 J. F. Nye 

accuracy of the experiments would lead to angles of 1 or 2° at the most for a. The lack 
of close agreement between the angles y and 8 also shows that the glide plane is not 
exactly crystallograpbic. 


Table 1. The cbystallogi-eaphio oeientatiosts of the glide planes giving 

BIREEBINGENT BANDS IN PLASTICALLY DEEOBMED SHEETS OF SILVER CHLORIDE 


1 

2 

3 

4 

6 

6 

7 

8 

9 

10 

11 











surface lines 



set of 

nearest 



aijtffles in decrees 



in same 

strip 

grain 

birefringent 

crystal 

/ 






direction 

no. 

no. 

bands 

plane 

a 


r 

S 

£ 

e 

as bands 

4 

A 

— 

, (111) 

8*1 

0-0 

8*3 

0*2 

0‘6 

6-5 

no 

4 

B 

— 

(111) 

3-0 

0*3 

3*1 

0*1 

0*0 

5-8 

no 

19 

1 

A (short bands) 

(111) 

2-2 

1-0 

4-2 

6-2 

0*5 

42-0 

yes 

19 

1 

B (long bands) 

(112) 

2-3 

2-0 

2-2 

4*1 

1-0 

28-0 

yes 

19 

1 

c* 

(100) 

4 

0*7 

9-1 

5 

0*5 

11-0 

yes 

19 

2 

A 

(111) 

2*0 

2*0 

2‘0 

2*1 

2-0 

4*2 

no 

19 

2 

Bt 

(112) 

4-6t 

4-3 

14 

14*8t 

5’Ot 

16*2 

faint trace 

19 

3 


(111) 

2-7 

0-0 

7-5 

4-8 

0-5 

4*9 

no 

19 

4 

— 

(111) 

4-5 

1-3 

3*5 

1 

1-2 

3-4 

no 


angle between P and the pole of the plane given in column 4. 

angle between bands and the trace which the plane given in column 4 makes on the surface, 
angle between normal to the surface and the plane given in column 4. 
angle between normal to the surface and the glide plane, 
angle between glide plane and the nearest <110> direction, 
angle between this <11 0> direction and the surface. 

* This set faint and barely measurable. 

t Measurements of (J, and therefore of a and £, were rough for this set. 

An interesting light is thrown on this fact by the values of the angle S given in 
column 9. C is the measured angle between the glide plane and the <U0> direction 
ly ing closest to it. It will be noticed that the unusually high value of 6-0° occurs for 
a set of bands on which only rough measurements could be made, while the rest of 
the figures in this column are less than, or equal to, the experimental error. We 




Figure 8. Stereographic projection showing the poles of the glide plane 
in the grains of silver chloride sheets. 



Proc. Roy. Soc. A, volume 198, toe 5 




FKaJiiE la. Figuke Ih. 

PicUTRio 1 a. SilvM’f cl)loi‘l(l(i (hvrorrncHl jilastJcially by tension applicxi in horizontal clirectioii. 

C/r()SS(Ml nieols. Vilira-tion diroctions parallel to cross-wires. (Magn. x 23.) 

FKai RE I 6. I'lie sanie but with oxternal tensile load removed. (Magn. x 23.) 


Figure 2a. Figure 26. 

Figure 2 a. Silver chloride slieot after plastic deformation. Crossed nieols. Vibration directions 

at 45° to cross-wires. (Magn. x 19.) 

Figure 26. The same after heating the sheet to 355° C and allowing to cool in the furnace. 

(Magn. X 19.) 


(JPajcing p. 196) 




W“ ' **" 


Proc. Roy. Soc. A, volume 198 , ylate 6 


Figuee 7. Birefringent bands. Crossed nicols. 





FiauiiE 5. Strip no. 4. Measurements on the bands in grains A and B 
are given in table 1. (Magn. x 147.) 

Figubbs 4, 5. Birofringent bands in sheets of silver chloride seen between c rossed nicols. 
Vibration direction of nicols parallel to cross-wires. 




Figure 6a. Showing glide lines and grain boundaries on the surface of a silver clilorido slioet 
Portions of four grains are visible. Polarizer in position but no analyzer. (Magn. X 97.) 

Figure 66 . The same as figure 6a but seen in a circular polariscope. (Magn. x 97.) 



Glide in silver chloride 197 

conclude that, although the glide planes do not coincide Tvith crystallographic planes, 
they always contain a <110) direction. 

This is shown graphically by the stereographic projection in figure 8. The plotted 
points are the poles of the glide planes. The <110) direction lying closest to the glide 
plane has in each case been assigned the indices [TlO] and all the points have been 
arranged to fall within one octant of the primitive circle. Such a procedure amounts 
only to a suitable choice of x, y and z axes in each grain. (When a grain contains 
more than one system of glid© bands the x, y and z axes have to be relabelled to deal 
with each set.) It will be seen that the points do not cluster in one position but are 
distributed near the great circle (straight line) OA, which is the locus of the poles 
of planes lying in the [TlO] zone. 

The results shown in the last two columns of table I will be referred to later. 

(6) Experiments on rectangviar bars 

The experiments of which details are given in table 1 had necessarily to be con- 
fined to grains containing band systems that could be made sharp by tilting the 
sheet, for only in these cases was it possible to determine the true orientation of the 
glide plane. Experiments have also been made on specimens in which the true 
orientation of the glide planes could be found by measuring the direction of their 
traces on two non-paraUel surfaces, regardless of whether any birefringent bands 
could be made visible or not. These specimens were in the form of rectangular bars 
of nearly square cross-section (measuring approximately 26 x 1 x 1mm.) cut from 
a rolled sheet. After the four main surfaces had been polished by a microtome method, 
the bars were recrystallized by heating for 2 hr. at 400° 0, cooled slowly in the 
furnace, and then the scratches left by the microtome were removed by dipping the 
bars for 15 sec. in a strong solution of sodium thiosulphate. This treatment revealed 
the boundaries of the recrystallized grains, and considerable portions of the specimens 
were seen, under the microscope, to consist of single chains of large grains. That is 
to say, each bar was equivalent to a succession of single crystals j oined up end to end. 
A further series of back-reflexion Laue photographs was taken to find the crystallo- 
graphic orientation of some of the glide planes produced by extension of these bars. 
Since the direction of the tension was known, such experiments also gave the possi- 
bihty of studying the relation between the glide plane and the stress system. The 
results are shown in tables 2 and 3. 

Six sets of active glide planes were examined. The four grains in which they lay 
were all from two to three times as long as they were broad, the lengths being mea- 
sured parallel to the axes of the bars. The tension in the central parts of these grains 
where the measurements were made would therefore have been reasonably uniform. 
The pole of each set of glide planes was found, with the aid of a stereographic net, 
by measuring the direction of the traces on at least two surfaces (non-parallel). In 
some cases the directions of the traces on all four surfaces were measured. The X-ray 
photographs were used, as before, to plot the positions of the poles of the crystallo- 
graphic planes on the same stereographic projection. For the first four sets of glide 
planes given in table 2 the nearest crystallographic plane of low indices was of the 
form {111}; values of the angle, a, between this plane and the observed ghde plane 



198 


J. F. Nye 


Table 2. The obystallographio objdentation oe glide bands in tensile 

BABS OE AEEEOXIMATELY SQUARE CROSS-SECTION 


1 

2 

3 

4 

5 6 

7 



set of 

nearest 

angles in degrees 


bar 

grain 

glide 

crystal 


— s 

no. 

no. 

bands 

plane 

a ^ 

V 

3 

1 

A 

(111) 

6-0 0-0 

3-0 

3 

2 

A 

(111) 

4-0 0'6 

2^0 

3 

2 

B 

(111) 

6-3 1-6 

1*8 

4 

1 

A 

(111) 

4-0 0-2 

8-9 

4 

2 

A 

/(111) 

0-0 

0*7 




1(221) 

4-6/ 


4 

2 

B 

/(111) 

ii-n 2-3 

6‘7 




1(221) 

4-7/ 



CL = angle between glide plane and the nearest crystal plane (column 4). 

^ = angle between glide plane and the <110> direction lying closest to it. 
fj = angle between glide plane and the plane of maximum resolved shear stress containing the 
glide direction. 


are given in column 6. The last two sets were nearer to {221} than (HI) planes, but 
the angles with both planes are given in the table for comparison. The experimental 
error in the determination was due mainly to variation in the directions of some of 
the slip lines across the grains and to the distortion of the faces and edges of the grains ; 
the latter caused uncertainty in the setting of the specimens in the X-ray camera 
and on the rotating and tilting stage of the microscope. The errors from these sources 
would be expected to be about ± l-S®. The angles given in column 6, therefore, make 
it certain that the glide planes, although near to crystallographic planes, are again 
not actually coincident with them. Nevertheless, in agreement with the experiments 
on the sheets, the values of ^ in column 6 are all much smaller than those of a and it 



Figubb 9. Stereographic projection showing the orientation of the glide planes in the rect- 
angular bars. o Pole, P, of glide plane. • Pole, P', of plane containing the. glide direction 
and subject to the masdmuna resolved shear stress. 



Glide in silver chloride 199 

seems permissible to conclude that the glide plane, although not accurately {111} 
or {221}, always contains a <H0> direction. 

Figure 9 is plotted on the same basis as figure 8. The open circles, which show the 
poles of the glide planes, are again distributed close to OA. 

(c) Pencil glide 

These results suggest a comparison between the plastic behaviour of silver 
chloride and that of a-iron. Taylor & Elam (1926) found that in a-iron at room tem- 
perature, although the direction of glide was crystallographieally determined, the 
glide plane was not; it depended on the state of stress. This peculiar behaviour of 
iron has been studied by other investigators,.inoluding Barrett, Ansel & Mehl (1937), 
and appears to be connected with the fact that the [111] direction is common to 
three dijfiferent glide planes, not crystallographieally equivalent, but each having the 
same critical shear stress at room temperature. Glide, takes place in the <11 1> 
direction with a movement like that obtained by shearing a bimdle of pencils in 
the direction of their length. 

In view of this the natural course was to find whether the <110> direction which 
lay in the glide plane of silver chloride was in fact the glide direction. We find indica- 
tions of this in the experiments on the sheets. Column 10 of table 1 gives the angle, 
e, which this <110> direction makes with the surface of the sheet. If it is the glide 
direction there should be no surface steps when it lies in the plane of the sheet and, 
other things being equal, the steps would be expected to increase in height as it 
makes increasingly large angles with the surface. It can be seen, by comparing 
columns 10 and 11, that whenever e is small there are no surface lines running in 
the same direction as the birefringent bands and, conversely, whenever e is large 
surface lines are observed. The hypothesis that <110) is the glide direction is thus 
strengthened. 

It is worth remarking that, since the <110) direction in a crystal of silver chloride 
is parallel to rows of like ions, it would be surprising if it were not the glide direction. 

A satisfactory theory of pencil glide has still to be found and this is not the place 
for speculation on the subject. We will content ourselves by oitmg the work of 
Elam (1936) on /J-brass and that of Andrade and his collaborators on slip in the 
body-centred cubic metals (Andrade 1938; Andrade & Chow 1940). 


6. The eelation between the gmob elements and the applied stress 

We now come to the relation between the direction of the tension and the glide 
elements in the rectangular bars. We first investigate the connexion between the 
axis of tension and the <110) direction lying in the glide plane. (A plane of maximum 
shear stress makes an angle of 45° with the tension and the direction of the shear 
stress in such a plane also makes 46° with the tension axis.) Table 3 gives the values 
of 1 0 — 45° I , where 6 is the angle between a <110) direction and the axis of tension, 
for all the crystallographieally equivalent <110) directions in the four grains whose 
orientations were known from the X-ray photographs. The values referring to the 
<110) directions that he in the observed ghde planes, and which we have tentatively 



200 J. F. Nye 

assumed to be the operative glide directions, are shown in heavy type; in each case 
they are seen to he the lowest values in their column. (For the purposes of table 3 
an arbitrary choice of x, y and z axes was made in each grain. The axes are therefore 
not those used in plotting the poles in figure 9, where the axes were always arranged 
so as to make [1 10] the glide direction.) 

Table 3. Showing how the six <1^®) uiebctions aee related 

TO THE DIRECTION OE THE TENSION 


values of 1 0—46° | (in degrees) 
bar no. 3 , bar no. 4 


direction. 

grain 1 

grain 2 

grain 1 

grain 2 

[110] 

13*7 

23-8 

29*0 

10-41? 

[ITO] 

6-0^ 

23*4 

22*7 

6-5^ 

[101] 

23-5 

VSA 

6-8^ 

15-2 

[lOT] 

38-9 

7-SB 

20*8 

13-4 

[Oil] 

20*6 

3M 

37*2 

28-5 

[OIT] 

17-8 

26*5 

14*3 

41-2 


Angles corresponding to active glide directions are shown in heavy type and labelled with 
the letter corresponding to the glide band system. 


We are thus led to the view that glide takes place in those <110) directions which 
lie most nearly at 46° to the axis of tension. 

The glide plane must, of course, contain the glide direction, and in Taylor & Elam’s 
work on iron it was found to be close to that one, out of the zone of planes containing 
the glide direction, on which the maximum shear stress, resolved in the glide direc- 
tion, acted. In a crystal under uniaxial tensile stress cr the shear stress, t, on a glide 
plane resolved in the glide direction is given by 

T = crcos ^ cos A, 

where 0 is the angle between the normal to the glide plane and the axis of tension, 
and A is the angle between the glide direction and the axis of tension. 

If A and cr are fixed, t is a maximum when ^ is a minimum. But §5 is least when the 
normal to the glide plane lies in the same plane as the glide direction and the axis of 
tension. This determines the position of the glide plane in an ideal case in which the 
above criterion is exactly satisfied. 

We may test how closely the condition is satisfied by constructing the pole, P', 
of this ideal plane. The positions of P' for each of the six glide systems are shown 
by the filled-in circles in figure 9. The results of measuring the angle, rj, between 
P' and P, the pole of the observed glide plane, are tabulated in column 7 of 
table 2. It will be seen that the correspondence between P and P' is not exact, 
but nevertheless quite close, the maximum divergence being 8-9°. It is apparent 
from figure 9 that the observed ghde plane tends to deviate from the position of 
maximum resolved shear stress towards a position lying between the (221) and 
(111) planes. It is therefore likely that the critical resolved shear stress for glide 
is not quite the same for all the planes in the [110] zone, and that it is a minimum 
for a plane somewhere between (221) and (111). This may explain why the points 



Glide in silver chloride 


201 

in figure 9 are only found near the central portion of the line OA-, for the grains 
•with orientations that would have given glide planes at other points of OA were 
‘harder’ than those that gave poles in the central region; and, since only grains with 
well-developed glide lines were chosen for measurement, these hard grains, which 
may not have slipped at all, were missed from the sample. 

The only other determination of the glide elements in silver chloride known to me 
is that of Stepanow (i934> ^935)' When single crystals were extended by simple 
tension he found glide in a <110> direction on planes -within 6° of {110}. This result 
is in agreement with the conclusions reached here, for in Stepanow’s experiments 
the crystals happened to be prepared with a {100} plane perpendicular to the axis 
of tension and, with this particular orientation, it is easy to see that the possible 
glide plane on which the maximum resolved shear stress acts is crystallographic — 
{110} in fact. 


6. The COREtTGATED NATURE OE THE GLIDE SUREAOBS 

It has been mentioned that in fiat strips of silver chloride either bent or extended 
by tension birefringent band systems frequently occurred unaccompanied by any 
glide lines. The reason for this is now clear. The planes of maximum shear stress 
envelop a cone of semi-vertical angle 46“ whose axis is the direction of the tension 
(or compression). Out of this group of planes, those which lie perpendicular to the 
plane of the sheet will tend to glide in a direction parallel to the surface of the sheet. 






■ 

■ 

llllllllll^^ 





Figotus 10. Deformation, of silver chloride bar by simple shear. 

There will thus be a tendency for sharp band systems -without surface steps to form 
at 45“ to the axis of tension. When glide lines do occur with such band systems they 
are invariably straight. This is in marked contrast with the wavy glide lines seen in 
grains in which the slip direction makes a considerable angle -with the intersection of 
glide plane and surface. These observations indicate that the glide surface is not 
quite plane, but is corrugated in such a way that sections parallel to the glide direc- 
tion are straight lines, while sections perpendicular to this are lines of a fixed general 
direction, but wavy in detail. Similar irregularities in the glide surfaces have been 
observed in other materials, notably in a-iron (Taylor & Elam 1926), although the 
phenomenon in this metal may depend on the impurities (Andrade 1938), and under 
certain conditions in mercury (Greenland 1937). 

Some other experiments confirmed this corrugated nature of the glide surfaces. 
When rectangular bars of the type already described were deformed by simple 
shear by holding them in tweezers in the way shown in figure 10, they sometimes 
gave glide planes perpendicular to the axis of the bar. This only happened in favour- 
ably oriented crystals, for one would not expect that a <110) direction would always 


202 J. F. Nye 

be found in the direction of shear. However, the experiment could be performed on 
many different crystals in the bar, and when nine different bars were tested in this 
way a number of such glide systems were produced. In all cases the glide lines in 
the direction of slip were quite straight, while those on the face perpendicular to the 
direction of slip were wavy and in many cases divided into several branches. 

It may be noted at this point that, on the theory of pencil glide presented by Taylor 
& and adopted here, the observation of ghde surfaces that approximate to 

planes is a consequence of a uniform plastic strain in the crystals. (The small scale 
non-uniformity introduced by the fact that glide does not take place on every atomic 
plane, but is concentrated in zones, is a much smaller effect than those we are now 
considering.) If glide took place by equal amounts on a series of equally spaced 
cylindrical surfaces of more complicated type than planes, the resulting large-scale 
distortion of the crystal would not be a uniform simple shear. Conversely, a non- 
uniform distribution of stress would be expected to produce curved glide lines on 
the surface of the crystal. Such curved glide lines could always be seen in a silver 
chloride sheet near places where the stress had been inhomogeneous, near an in- 
dentation, for mstance, or in the distorted region left by the razor blade used for 
cutting the sheet. Each glide line corresponds, at all points of its length, to one 
glide direction, and it terminates at the points m the crystal where this direction 
ceases to be the one corresponding to the most favourable plane for slip. 


7. EB'I’BCT OB GEAIIT BOXTNDABIBS ON SLIP PROPAGATION 

On the hypothesis of pencil glide, the progress of plastic deformation in a poly- 
crystalline specimen of silver chloride (or (Z-iron) might be expected to occur as 
follows. At first all grains are within their elastic limits. When the stress is increased, 
the critical resolved shearing stress for glide is reached first on a particular set of 
planes in one particular grain, which we denote by A. This grain will slip and, since 
its surface is now stepped, it will exert non-uniform stresses on its neighbours B,G,D, 
etc. If the applied stress is further increased, the part of B, for instance, which is 
situated near the surface of contact with A will either remain purely elastically 
strained, or will slip plastically in such a way as to conform to the stepped surface 
of A. In a material which was celpable of glide only on crystallographic planes this 
could not be achieved in a simple way for the well-known reason that no potential 
glide plane of B would intersect an operative gHde plane of A in a line that lay in 
the grain boundary. In crystals capable of pencil glide, however, conformity at the 
boundaries can be achieved without difficulty. The glide surface in the grain B, for 
instance, would be determined by the foEowing construction. If XF is the line of 
intersection of one of the operative glide planes in A with the boundary between 
A and B, draw through each point of XF lines whose directions are the possible 
ghde directions in B, six in number if the ghde direction is <110> (and four if it is 
<11 1». These lines wiU be the generators of six (or four) surfaces on which ghde is 
geometricahy possible. The surfaces wiE be planes if the grain boundary is itself 
a plane and ghde in A has taken place on a plane, but, in general, they wEl be 
cylindrical surfaces of comphcated form. In the immediate neighbourhood of the 



203 


- Glide in silver chloride 

boundary the resolved shear stress acting on each of these glide systems will be 
largely due to the alternating stresses caused by contact with the stepped surface, 
of A. That one will operate on which the resolved shearing stress is greatest, but it 
may, of course, be necessary to raise the external stress before this happens. In the 
centre of B the stress system will be disturbed from the configuration it had before 
grain A slipped not so much by these alternating stresses as by the macroscopic 
change of shape which grain A has suffered during the slip process. In general, then, 
a different slip system will tend to operate in the centre of the grain from that needed 
near its surface. The same considerations apply to Jl’s other neighbours 0, D, etc., 
and similar processes will occur throughout the whole specimen. 

These views, while partly conjectural, find support in the present experiments. 
For instance, the grain immediately below the intersection of the cross-wires in 
figure 2 a, plate 6 contains a sharp set of birefringent bands and at the same time its 
neighbour on the left is stressed non-uniformly down its right-hand edge by the 
translations that have taken place parallel to the bands. The same effect is seen in 
figure 7, plate 6 at the boundary marked XX. (For these photographs, the external 
stress was removed.) Many other instances of this effect have been seen. 

Again, the frequent occurrence of glide lines which do not stop at grain boundaries 
but continue across them without a break is a natural consequence of the property 
of pencil glide. It is difficult to conceive such an event, except as a coincidence, in 
crystals not capable of pencil glide. The effect is well seen in figure 6a, plate 8. 
Another instance of a similar event is provided by figure 6, plate 7. Here there are 
two grains A and B separated by the boundary XX. The birefringent bands in each 
grain are in step as they cross XX, but it can be seen that they turn through a sharp 
angle. Beferring to the values of ^ in lines 1 and 2 of table 1, we see that both sets 
of birefringent lamellae are very nearly perpendicular to the surface. (This is also 
approximately true of the boundary.) This again would be inexplicable, except as a 
coincidence, with the usual glide mechanism. 

It may also be noticed that grain B in this photograph contains several glide zones 
that end in the centre of the grain. One example is marked with a cirole. 


8. Types op residual stress withik ORAiiirs 

The main types of residual stress seen in the silver chloride sheets may be briefly 
fisted. 

(1) A system of stresses set up between the glide zones of each grain and alter- 
nating with a period equal to the spacing of the glide zones. A detailed analysis of 
these is given in part II of this series (Nye 1949). They are produced in both single 
crystal and polycrystalfine specimens and can reasonably be attributed to the 
presence of dislocations trapped in the glide zones. 

(2) The alternating stresses produced near the grain boundaries at each end of 
a system of glide zones in a slipped grain. 

(3) ‘ Heyn stresses ’ produced by non-uniformity of plastic deformation from grain 
to grain (see, for example, Seitz 1943). (This category includes the residual bending 
moments referred to in § 3.) 



204 


J. R Nye 

9, The ahhealihg eroobss 

A scries of experiments lias been performed with the object of studying the 
progress of annealing in a cold worked silver chloride sheet. The thermal expansion 
of cubic crystals is isotropic and, provided the temperature of the sheet were always 
uniform, no stresses due to the different expansions of the grains, such as are found 
in polycrystalline specimens of non-cubic materials, would be expected. * 

The sheet shown in figure 2a was heated to 355^^ 0 and allowed to cool in the 
furnace. Under these conditions recrystallization did not occur and the sheet 
appeared as shown in figure 26. There was no alteration in the position of the light 
places, but only a reduction in their intensity. This shows that annealing does not 
bring about any large-scale redistribution of stress, but only a general reduction of 
its magnitude. After a further period of heating of f hr. at 415° C the pattern had 
almost disappeared. It was essential to cool slowly from the high temperature if 
a strip was to be obtained free from internal stresses. The thermal gradients set up 
by removing a specimen directly from the furnace at 400° C to the air of the room 
caused internal stresses and plastic deformation. It is true that silver chloride is 
a bad conductor of heat, but the photoelastic method is more sensitive than a surface 
examination in revealing plastic deformation. This experiment illustrates the care 
that must be taken in the annealing even of cubic materials if a specimen free from 
internal stresses is to be obtained at room temperature* 

The use of silver chloride for the investigations described in this part and part II 
was suggested by Dr E. Orowan. I should like to thank him warmly for allowing me 
to develop his idea and for his help and counsel during the work. 

The experiments were financed by the British Iron and Steel Research Association. 


Reebbekobs 

Andrade, E. N*. da C. 1938 Proc. Boy, Soc. A, 168, 310. 

Andrade, E. da C. & Chow, Y. S. 1940 Froc, Roy. Soc. A, 175, 290. 

Barrett, C. S., Ansel, G. & Mehl, R. P. 1937 Trans. Amer. Soc. Metals^ 25, 702. 

Brewster, D. 1818 Trans. Boy. Soc. Edinh. 8, 167. 

Brewster, Sir David 1853 A treatise on optics ^ pp. 280, 281. London; Longman, Brown, 
Green and Longmans. 

Brilliantow, N. A. & Obreimow, I. W. 1934 Phys. Z. Sowjet. 6, 687 (in English). 

Brilliantow, N. A. & Obreimow, I, W. 1937 Fhys. Z. Sowjet. 12, 7 (in English). 

Elam, C. E. 1936 Froc. Boy. Soc. A, 153, 273. 

Greenland, K. M. 1937 Froc. Boy. Soc. A, 163, 28. 

Greninger, A, B. 1935 Trans. Amer. Inst. Min. (Metall.) Bngrs, 117, 61. 

Nye, J, F. 1948a Nature, 161, 367. 

Nye, J. F. 19486 Nature, 162, 299. 

Nye, J. F. 1949 In course of preparation. 

Obreimow, I. W. & Schubnikoff, L. W. 1927 Z. Phys. 41, 907. 

Orowan, E. 1942 Nature, 149, 643- 

Reusch, E. 1867 Ann. Fhys., Lpz., (v), 132, 441; Froc. Boy. Soc, Edinb, 6, 134. 

Seitz, F. 1943 The physics of metals, p. 147. New York: McGraw-Hill. 

Stepanow, A. W. 1934 Phys. Z. Sowjet. 6, 312 (in English). 

Stepanow, A. W. 1935 Phys. Z. Sowjet. 8, 26 (in German). 

Taylor, G. I. & Elam, C. F. 1926 Froc, Boy. Soc. A, 112, 337. 



One-dimensional dislocations. 

I. Static theory 

By F. C. Frank and J. H. van deb Mbewb 
H. H. Wills Physical Laboratory, University of Bristol 

{Communicoited by N. F. Mott, F.B.8. — Beceived 22 December 1948 — 

Revised 26 March 1949 — Bead 19 May 1949) 

The theory of a one-dimensional dislocation model is developed. Besides acting as a 
pointer to developments of general dislocation theory, it has a variety of direct physical 
applications, particularly to monolayers on a crystalline substrate and to conditions in the 
edge row of a terrace of molecules in a growing crystal. Allowance is made in the theory for 
a difference in natural lattice-spacing between the surface layer or row and the substrate. 

The form and energy of single dislocations and of regular sequences of dislocations are cal- 
culated. Critical conditions for spontaneous generation (or escape) of dislocations are deter- 
mined, and likewise the activation energies for such processes below the critical limits. 
Various physical applications of the model are discussed, and the physical parameters are 
evaluated with the aid of the Lennard- Jones force law for the above-mentioned principal 
applications. 

1. Inteobxtctiok 

The theory of dislocations is moderately complex even with extensive approxima- 
tions and unreal restrictions such as that to a simple cubic lattice, and there remain 
many problems, e.g. of dislocation dynamics, which have only been treated quali- 
tatively even with these simplifications. It has therefore seemed worth while to us 
to make a more extensive survey of the properties of dislocations in a still more 
drastically simplified model, the simplest which can display dislocations at all. There 
are several different mechanical systems, all leading to the same equations. We choose 
one of them to set up the fundamental equation. It consists of a number of identical 
balls connected by identical springs and arranged in a straight line to which their 
motion is constrained, the balls being at the same time acted on by a force which 
varies periodically along the straight line. In the first instance, we take the periodic 
field to be sinusoidal. This is realized to a close approximation if the chain of balls 
and springs rests in a long horizontal fidctionless trough with vertical side walls and 
a sinusoidal corrugation of low ampUtude on the bottom of the trough, so that the 
periodic field is provided by gravity. We may hereafter refer indiscriminately to the 
balls as ^ atoms * and to the source of the periodic field as the ^ substrate ^ though this 
does not express the only application of the equations to the physics of solids. Cer- 
tain aspects of this or equivalent systems have been treated by Dehlinger ( 1939 ), 
Trenkel & Kontorowa ( 1938 ) and Lennard-Jones (i94®)» brevity we derive 

the results ab initio. 

2. The niEFBRENOE equation 

If a is the ‘ wave-length ’ of the potential due to the substrate and its amplitude, 
b is the spacing between atoms of the chain when their connecting springs are un- 
strained, [i is the force constant of these springs, and is the displacement of the 


Vol. 198. A. 


[ 205 ] 


14 



206 F. C. Frank and J. H. van der Merwe 

«th atom from the nth trough of the substrate potential, each numbered in sequence 
from an arbitrary commencement, then the potential energy corresponding to N 
atoms of the chain is 

= S + + S (l-cos27ra;Ja). 

This may be written 

0 0 
if we introduce the abbreviations 

^r, = xja, P,=.alib-a), 1,^{^^I2W)K 

S 


L 

0- 


R 


^ is here the displacement measured in units of the substrate spacing; Pq 
called the vernier period of misfit; the significance of will become evident later. 
The condition for equilibrium of the nth atom is dV/d^^ = whence by differentia- 
tion of (1) 

^lZ= {7Tl2ll)mi27r^. ( 2 ) 

Here we employ central difference notation 

AU=^»-i-2^. + W 

Solutions of equation (2) can be constructed numerically from specified displace- 
ments of any two successive atoms. Some simple symmetrical solutions can be found 
by elementary algebra, e.g. that of figure 1 corresponding to 

1111 = { 1 - 8“ 27rCi (0 < Cl < i) • 



n 

FlOtTBE 1 


3. The dotbbential equation and its solutions 
^^®“- Cn+i C-i are expanded in Taylor series about the point n one obtains 







207 


One-dimensional dislocations. I 


When la is large, i.e. the springs are relatively strong, = (7r/2Z§) sin 27r^„ will be 
small, and one is justified to a first approximation in neglecting differential coeffi- 
cients of higher orders. Hereafter we shall use, for the condition of equiUbrium, the 
differential equation 

d%ldn^ = (7r/2Zg) sin 27t^, (3) 

and keep in nadnd that the greater Iq the better the approximation. 

Integration of (3) gives 

(d^ldn)^ = + (1 _ eos 20/2Z§ 



being an integration constant, d^/dn is the amormt by which the distance between 
successive atoms, measured in units of a, exceeds unity, e is evidently the value of 
this excess where ^ — 0, i.e. where successive atoms are in or close to troughs of the 
substrate potential. 

Let us write 

k^ = ll{l+eHI) and 

and choose the zero for n where ^ = i, i.e. ^ = 0, then (4) becomes 

d(j)jdn = ±{7rjlak) (1 — ii^sin®^)*, (6) 


where the -f- ve sign refers to expansions of the chain of atoms with respect to the 
substrate, producing what we shall call negative dislocations. The negative sign 
corresponds to compression relative to the substrate, and positive dislocations. If 
we introduce the convention that i!,, > 0 for negative dislocations and la<0 for posi- 
tive ones, we may discard the ± sign in (6) without loss of generality. It is convenient 
Ijo consider negative dislocations, which we shall do hereafter, and keep in mind that 
the results obtained are also valid for positive dislocations. 


(i) Dislocations far apart (e = 0; i: = 1) 
If e = 0, then k — 1. Then from (4) 

dCldn = {llla)sm7T^, 


whence 


wn/Zo = ttJ^ (1/sin 


This can be written 
where » = 0 when ^ = J, and 


= In tan (zr^/2). 
^ = (2/7r)arctane’'"/'», 
d^/(Di, = (l/Zo)sin7rf. 


(6) 

(6a) 


Figure 2 gives a graphical illustration of (6). It represents a single dislocation 
infinitely far fix)m any other in the chain. From (6a) and figure 2 one sees that 
approximately in a distance corresponding to Zq atoms the displacement ^ changes 
by 1. In a relatively small region Zq, therefore, there exists a state of misfit between 
the atoms and the substrate, while everywhere else there is a nearly perfect degree 
of fit. In a positive dislocation there is one more and in a negative dislocation there 

14-2 



208 


!P. 0. Frank and J. H. van der Merwe 

is one less atom than there are troughs of the substrate potential. Z0, which is a 
measure of the magnitude of the region over which misfit extends, may be called 
the effective length of a dislocation. 


1-0- 






0-5- 






1 r- ^ 

n ^ 


FiGtruB 2 


(ii) General solution: k<l 

From (5) it follows that 

Trnjlf^ h — \ (1 — A* sin® dijr 

= Fik,<^):^F{k,n^-^ 7 r) (7) 

in the standard notation for elliptic integrals. This can be written 
. ^ = i + (lln)am{nnllQk) 

and d^jdn = ( I/Zq dn (nnjlQ k) 

= . (1/^0 cos® wO*, (8) 

employing the notation of eUiptic functions (Jahnke Emde 1938). 

Figure 3 illustrates a typical example. 



PiauBB 3 


One-dimensional dislocations. I . - 


209 


We now hare a regular sequence of similar dislocations. 

From (8) and figure 3 one sees that the effective length of a dislocation is now 
reduced to hl^, and the number of atoms per dislocation is 

P = 2lf,hK{k)jTT, (9) 

where K{Jc) is the complete elliptic integral F(k, \n). 

Referring back to figure 1, it is to be observed that even for small values of l^, so 
long as Iq exceeds unity, there is good qualitative correspondence between solutions 
of the difference equation (2) and the differential equation (3). 

4. EQUILIBBIXm WITH FEES BHDS 

On differentiating (1) the term in and hence all reference to the natural spring- 
length b, disappeared. This is only significant in connexion with the boundary 
conditions. We consider the case in which the chain has a free end whereas the 
substrate continues. 



For simplicity we speak of tension, which is appropriate for negative dislocations 
{Iq > 0). The results for positive dislocations are identical. 

The tensions in the springs are given by 

= yK«(^«+i-C,v-l/-Po) 

= ljuj{d^ldn + ( 1/2!) d^ldn^ + .. . - 1/Po}. 

Neglecting higher terms in the Taylor series, and substituting for d^ldn from (8), 
we find 

= (2WZ§/o){(1-Fcos%0*/*Z„-1/Po}, (10) 

and in the special case where e = 0,k = 1, P = co, 

= ( 2 Wma) {(1/g sinTrC- 1/Po}. ( 10 a) 

A graphical representation of (10) is shown in figure 4. 

If th^ springs are cut at the points represented hj A, B and C in figures 3 and 4, 
then the separate parts of the chain will remain in equilibrium with their ends free 



210 F. C. Frank and J. H. van der Merwe 

—cutting it at B, for instance, will leave that part to the right to B in stable and that 
to the left in unstable equilibrium. The displacements of the free ends = 0) 


follows from (10), i.e. 

COSTT^o = ± (1/* 

and from (10a), at a free end far from any dislocation, 

sinTT^o = 


( 11 ) 

(11a) 


6. Energies 


5-1. Potential energy fer dislocation Vo 
From (1) the potential energy per dislocation is 

Vo = (i - cos 2v^„) 

n =>0 

= Wllj^(dZldn-llPo)^dm,+ W sin® 7 rCd», 

where ^^+1 i® expanded in a Taylor series, terms of higher order than the first are 
neglected, and the summation replaced by an integral. Hence, using (7), (8) and (9), 
and carrying out the integration, this gives 

= Wll{4cE(k)lnM^-2(l^k^)K{k)ln^^^ ( 12 ) 


where E{k) 
kind. 




Psin®^)* dxjr is the complete elliptic integral of the second 


Vo, thus defined, includes strain energy which is present in the chain (unless l/Pj 
is zero) in the absence of dislocations. This energy is represented by the final term 
in -(12). We may omit this term if we prefer to regard the undislocated state as the 


zero of potential energy. 


5-2. Oenercdion of a dislocation by force on the free end 

A dislocation can be generated by pulling, the end atom reversibly from its equi- 
librium displacement ^ {A, figtues 3 and 4) to its next stable equilibrium displace- 
ment 1 -h fo (C*, figures 3 and 4). The work done during this process is 



= a I td^ as follows from figure 4 
Jo 


= 2Tfi§{2P(ifc)/7rHo-l/Po}- 

The corresponding value for single dislocations then becomes (limit Ik = 1) 

Tf^ = 2Wi§(2/7rZo-l/Po)- 


(13) 

(13a) 


It may be noted that if Q is the total potential energy of a long chain of atoms 
containing N dislocations, Vo is the mean potential energy per dislocation, QjN, 
while Wo is the differential energy per dislocation, dQ/dN. This relationship may be 
verified by integration. 



One-dimensional dislocations. 1 


211 


6‘3. AcUvaiion energies 

Consideration of this process, the application of force to the end atom of the chain, 
enables us to define an activation energy for the introduction of a dislocation, i.e. 
the work done in pulling the end atom reversibly from its stable equilibrium dis- 
placement (A, figures 3. and 4) to its unstable equilibrium displacement (JB, figures 
3 and 4) given by 

ri-fo 

Ug=^a\ td^ 

Jj. 

= 4 Wll {E{k, |7r - n^)l7Tklo - - ^o)IPo}> (14:) 

where is given by (11). E{Jc,\Tt-TtQ is the incomplete elliptic integral of the 
second kind. 

For dislocations far apart (limit A: = 1) 

Ug = 4.Wll{{l-ooB7rQj7Tlg-{^-QIPg). { 14a) 

Similarly, the activation energy for the escape of dislocations is 

~ ® I td^ 

J 14:£. 

^Ug-W^. (16) 

6. STABIUTy LIMITS 

The necessary and sufficient condition that a given solution of equation (3) can 
correspond to equilibrium when the ends of the chain of atoms are free is that there 
shall exist a real solution Jo of fh® equation t(Jo) = 0, i.e. equation (11). Hence 

l-kHllPl<k^ and 1-F1§/P§>0, 

i.e. illk^-l)i<lglPg<llk. (16) 

If llk<lg(Pg dislocations will be spontaneously generated at the ends, while if 
(1/F — l)*>!!o/Po spontaneously escape. When dislocations are absent or 

far apart (16) reduces to 

0=^lJPo<l. (16a) 

The state of lowest energy will be that for which the energy remains stationary for 
generation or escape of dislocations, i.e. that for which = 0, so that 

Afl/Io ~ 2E{k)j7rk, (17) 

and the corresponding equation for dislocations far apart is then 

lo/Po = 2/71- (17 a) 

This defines the critical value of misfit, above which the lowest energy state of the 
system is one containing dislocations. A somewhat larger misfit, as given by (16a), 
is needed for them to be developed spontaneously (at the absolute zero of tem- 
perature). ' 



212 


F. C. Frank and J. H, van der Merwe 


7. Appmoa-tions op the model 

7-1. Imagine a solid crystal with a simple cubic lattice containing one or more 
straight parallel Taylor {1934) dislocations (aU positive or all negative) in a single 
glide-plane. Suppose the material to be shaved away, parallel to this glide-plane, 
tiU there remains only one layer (or a very small number of layers) of atoms above 
the glide-plane, while the crystal is still thick below it. In the new elastic equili- 
brium nearly all the steain associated with the dislocation{s) will be in the thin 
layer above the glide-plane, and the crystal below can be regarded as effectively 
rigid. Then in each row of atoms immediately above the glide-plane and normal to 
the dislocation line(s) the conditions are substantially those of our first model, and 
the equations developed may be applied to the system with reasonably good 
accuracy. The system may be treated one-dimensionally so long as the dislocation 
lines remain straight and parallel. In this case b is approximately equal to a, but not 
exactly, since in general a surface layer of atoms in a crystal has a tendency to expand 
slightly if held together by homopolar forces, or to contract if they are ionic. Larger 
differences between b and a arise when we are considering overgrowth or absorption 
of one substance on a crystal of another. 

7-2. Along the edge of an outer layer, or at a place where it is incomplete, ending 
in a straight terrace edge, we have a row of atoms attracting and repelling each other 
so as to tend to maintain a natural spacing b and subject to a substantially sinusoidal 
potential field of wave-length a (not in general exactly equal to b) arising from then- 
neighbours in the rest of the crystal. This is a truly one-dimensional problem, corre- 
sponding closely to our first model. We estimate the physical parameters of the 
model for correspondence with this case from a Lennard-Jones law offerees (1924) 
between the atonos in § 8, and find that the appropriate value of Zq, the effective spread 
of the dislocation, is about 7. 

Values of Iq of similar order of magnitude will apply in case (7-1) when we consider 
dislocations between a surface monolayer and the remainder of the crystal. 

7-3. If in case (7-1) we start with Burgers (1939) ‘screw’ dislocations in place of 
the Taylor dislocations, we shall finish with a situation which is again well described 
by our equations, in which, however, the displacements are transverse to the 
chain of atoms enumerated by n. The atomic positions wiU in fact correspond 
directly to the graphs of versus n shown in figures 2 and 3. These dislocations, 
corresponding to screw dislocations in three dimensions, may be called ‘transverse’: 
those of case (7-1) corresponding to Taylor dislocations m three dimensions are 
‘longitudinal’. The longitudinal and transverse dislocations of a surface layer can 
be of importance in connexion with adsorbed monolayers, oxide layers, and so on. 
These, and the longitudinal edge dislocations (7-2) all deserve consideration 
in connexion with the original formation of dislocations in the growth of a 
crystal. 

7*4. A long-chain molecule, say of a normal aliphatic compound, which has some 
torsional flexibihty, and is restrained in twisting by the forces from parallel neigh- 
bor chains in the crystal, provides another system to which the same equations 
are applicable, wherein the displacements are angles of rotation. 



One-dimensional dislocations. I 


213 


7-5. A somewhat different physical application is shown in figure 5 which gives 
a formal two-dimensional description of forced twinning (e.g. as observed in calcite). 
For the case pictured, an unstrained (two-dimensional) unit cell is supposed to be 
a 60® rhombus. The state shown, with three or four badly strained unit cells near the 
centre and around that a rapid asjnmptotic approach to a strain-free condition, 
should by its motion translate the twinning surface, causing one twin to grow into 
the other: and an applied stress should cause it to move. This configuration, however, 
is not a simple dislocation; its presence does not disturb the continuity of lattice 
planes. In so far as the twinning plane may be looked upon as a close grid of parallel 
Taylor dislocations with displacement vectors normal to the plane (Wooster 1940) 
(which exert forces on each other to hold their neighbpurs in the same plane, and 



are moreover subject to a periodic * substrate potential ^ because they are in a crystal), 
the figure represents a transverse dislocation of the system of dislocations. We may 
call it a dislocation of second order. Qualitatively, we may describe it by the equa- 
tions of a one-dimensional dislocatable system developed above, representing 
the displacement of an individual Taylor dislocation. However, at a distance from 
this second-order dislocation there exists a state of strain similar to that around 
a single Taylor dislocation; there are {n+l) unit cells in the same distance, measured 
parallel to the twinning plane, above it, as contains n below it. On this account 
these second-order dislocations exert long-range repulsive forces on each other. 
This agrees with observation; Garber’s (i947) ©l^stic twin wedges are only about 
1 /Jb thick for a length of the order 1 cm., so that 1/P, the inclination of the twinning 
surface to the ideal twinning plane, probably does not exceed about 1/10,000 to 
1/1000. 

There are other second-order dislocations, a sigmoid bend of an ordinary dis- 
location line from one lattice line on to the next parallel one being the simplest type. 
These are required to describe the inclined twinning surfaces in a Reusch ‘rectangle ’ 


214 


P. C. Frank and J. H. van der Merwe 

or Garber ‘leaf’ elastic twin of calcite, as contrasted with the Garber wedges. Being 
dislocations of a one-dimensional system, they also are qualitatively illustrated by 
the equations investigated. 

8. in order to apply the equations obtained for our model to actual physical 
systems, it is necessary to assign values to the physical parameters, Po- The 

most important applications are to a surface ‘island’ layer of atoms on a crystal, 
in which we may have dislocations of the edge row with respect to the rest of the 
island (§7-2) or of the layer with respect to the substrate crystal (§ 7- 1 ). In particular, 
in the latter case the atoms of the surface layer need not be of the same kind as those 
of the substrate. In this case P,, can be estimated approximately from the lattice 
spacings of the bulk materials. To this approximation it becomes infinite when the 
atoms are of the same kind, but this is not correct. The values both of Pq and of Iq may 
be estimated in this case by the methods of Lennard- Jones. Let the suffixes 
X = I, II, III, refer to close-packed one-dimensional, two-dimensional, and three- 
dimensional lattices (the latter in cubic close packing). Then the potential energy 
per atom may be written i 

where A and B are constants, a the interatomic distance, and ”*8x signifies the 
lattice sum for the exponent — m and the lattice X. We shall actually take n to be 
12 and to be 6. . 

The equilibrium spacing ajy is defined by the condition 37 jda = 0. Thus 

ocx = (nB'^8xlmA’^8xW‘^~^'^- 

The latent heat per atom. Lx, is minus the potential energy per atom at equi- 
librium spacing. Thus 

Lx = m)/». 

The force constant in homogeneous compression is 

flx = = m{n~m)A”'8xa.x”'~^ 

' = mnLxOi-'x- 

The lattice sums required are 

= 1-0178, «-Sfn = 3 X l-063o, ^8ja = 6 x 1-20433, 

= 1-00025, “>S!n = 3 X l-OOlg, = 6 x 1-01097, 


the values for ^8^ and being those of Jones & Ingham (1925). 

If ao =: (2P/J.)* represents the equilibrium spacing of a pair (or triangle or tetra- 
hedron) of atoms, we have 

“oK = 1 * 0027 , ao/an = l-Ol^, • ao/am = 1-029, 

Uj/ocjj — 1-007, cct/ ^ttt ” 1-026, * 1*019. 


Accordingly, the natural misfit of a row at the edge of an island layer is given by 
its vernier period 


Po(I,II) = 144, 



One-dimensional dislocations. I 215 

and correspondingly the natural misfit of a surface layer on the three-dimensional 
orystelisgivenby J>.(n,UI) . 63 - 5 . 

To estimate Iq for the case of the edge row of a two-dimensional lattice we consider 
the potential energy of a single atom rolling in contact with the ‘straight’ edge of 
a semi-infinite two-dimensional lattice. We take the lattice spacing to he uniform, 
as though the atoms were hard spheres of diameter an, and the path of the addi- 
tional atom to be the same as it would be if it were a hard sphere of the same dia- 
meter. Then the highest and lowest energies are obtained at the two symmetrical 
positions where it makes respectively one and two contacts. These energies are 

l-019.4afi« = 0 - 639 X 11 = 0-274Xiii, 

0-784ilafi« = 0-492Xn = 0-193Xni, 

respectively. Taking the fluctuation to be sinusoidal as an approximation (which we 
shall refine in a later paper in this series), the appropriate value of W in equation (1) 
is the difference between these two energies: 

Wii = 0-3254aii« = 0-204Xn = 0-0807Xin. 

We hkewise have the force constant 

= 36.4af « X 1-0173. 

And hence 

II = /tia|i/2Tfii = 36 X l-0173a|i/2 x 0-326a|. 

Thus Zo(I.II) = 7-35. 

Zo is naturally larger the harder the atotas. Since W is approximately proportional 
to L, Zq is approximately proportional to (wn)*. 

We may now find the energy of such an edge dislocation, from equation (13a). 

■ ' Tli=Tf(4Zo/7r-2Z§/Po) 

= 0:0807Xiii(9-37-0-75) 

=s O’GSSJSjjj. 

Thus it has just the same order of magnitude as the latent heat of evaporation of 
a single atom. Such edge dislocations should therefore be fairly frequently formed 
by thermal agitation. 

The value of Iq should be approximately the same for dislocations of the surface 
layer with respect to the underlying three-dimensional crystal; for the force constant 
fi should be about the same in both cases (we are not dealing with homogeneous com- 
pression in this second case), and the value of W corresponds to the breaking of one 
atomic contact per atom in both cases. In this case the dislocation is a line and its 
energy proportional to the line length of the order of magnitude of an atomic latent 
heat per atomic spacing of length. It is doubtful whether any appreciable number of 
dislocations will be formed in thermal equilibrium in this case, but it may be noted 
that the misfit parameter Iq/Pq « 0-15 is sufiaciently large to give an activation energy 
hindering the escape of any dislocations which are formed, except when they are 
very close together (about 16 atomic spacings apart according to equations (16) 
and (9)). 



216 


F. C. Frank and J. H. van der Merwe 


We have to thank Professor N. F. Mott for suggesting this problem, and one of us 
(J. H. van der Merwe) has to thank the South African Council for Scientific and 
Industrial Research for a grant, and special leave which rendered it possible to 
perform this research. 


Refbebnoes 

Burgers, J. M. 1939 Proc, Acad, Sci, Amst, 42, 293. 

Dehlinger, U. 1939 Ann, Phya,, Lpz,, (5), 2, 749. 

Frenkel, J. & Kontorowa, T. 1938 Phya, Z, Sowjet, 13, 1. 
Garber, B. 1947 Rmaian J. Phya, 11, 55. 

Jahnke Emde 1938 Functionentafeln, pp. 52-90. 

Jones, Jl E. 1924 Proc, Boy, Soc, A, 106, 463. 

Jones, J. E. & Ingham, R. E. 1925 Proc, Boy, Soc. A, 107, 636* 
Lennard- Jones, J. E. 1940 Proc, Phya, Soc, 52, 38. 

Taylor, G. I. 1934 Proc, Boy. Soc, A, 145, 362. 

Wooster, W. A. 1940 Proc, Phya, Soc. 52, 55. 


One-dimensional dislocations. II. Misfitting 
monolayers and oriented overgrowth 

By F. C. Frank and J. H. van dbr Mbbwb 
H, H, Wills Physical Laboratory, University of Bristol 

{Communicated by N. P, MoU, F,R,8. — Received 22 December 1948 — 
Revised 25 March 1949 — Read 19 May 1949) 


The equations derived in part I of this series for a one-dimensional dislocation model are 
applied in this paper to the case of a monolayer on the surface of a crystalline substrate, 
particularly when the natural lattice spacing of the monolayer diners from that of the 
substrate. Justification is given for this extension of the equations to the two-dimensional 
case. It is shown that the theory predicts a certain critical amoimt of misfit (9 % difference in 
lattice spacing in a simple case) below which the monolayer in its lowest energy state is 
deformed into exact fit with the substrate, and above which it is only slightly deformed in 
the mean, having many dislocations between it and the substrate. The energy of adsorption 
as a function of misfit is also calculated, becoming almost constant above the critical limit. Up 
to a larger critical misfit (about 14 % in the same simple case) the monolayer can be deposited 
metastably in exact fit on the substrate, at sufiaciently low temperature. Since the dislocated 
layer is mobile on.the surface, completely oriented overgrowth of one crystal on another can 
only be expected if the first monolayer can be formed over the complete surface under sub- 
^ critical conditions. This is in general agreement with observation. 


' 1. PbELIMIN-ARY INVESTIGATIOK 

We propose to apply the equations derived in part I for one-dimensional dislocations 
to misfitting monolayers on crystalline substrates. Before doing so we examine 
a little more closely the validity of these equations for a two-dimensional problem. 
Let us take as a simple first-order representation of the potential energy of a two- 
dimensional square array of atoms, enumerated by n in the x direction, m in the 



217 


One-dimensional dislocations. II 

y direction, on a crystal face on which the minima of potential energy for single 
atoms likewise make a square away (e.g. (100) faces of simple cubic or close-packed 
cubic lattices): 

F = s ilF[l - cos 2nZn.m. + 1 - cos J 

n,m 

n.m 

n,m 

-f- S ^IIo)iVn,m+l~ Vn.m (1) 

n.m 

Here a{n+^n,m) — ®n,m = ¥n.m ^b® co-ordinates of the atom 

enumerated {n,m). The first summation expresses the substrate potential, being 
(for different axes in the two cases) the commencement of a two-dimensional Fourier 
series representation of the field at the surface of either of the two crystals mentioned 
above. The second summation expresses the elastic resistance of the monolayer to 
change of lattice spacing from its natural value b, not necessarily equal to that a of 
the substrate, as indicated by the vernier period of misfit, Pq = aj{b—a). The third 
summation expresses the elastic resistance of the monolayer to shear, the expression 
in brackets being (for small differences of the displacements) equal to the amount by 
which the angle made by atoms {n + 1, m), (n, m), {n,m+l) differs from its natural 
value in. Thus /i and v are respectively force constants for compression and shear of 
the monolayer. A is a force constant for shear at 46® to the former one. 

The conditions for equilibrium of the atom («., m) are: 

0 = 0F/ac„,« 

— nW sin 

(7n,9n— 

X<P {ifn, mr¥L ~Vn,m~~ Vn-1, m+1 Vn—1, m) 

snW sin 2n^-fia^i\%^- (2«) 
and 0 = ttIF sm27r9/-/{a*AOT5/ — va^A|9;-Fva®A„i+iA„_jf-l-Au*A„i_jA,i+jC- (20) 

Solutions of equations (2 a, b) include a number of simple cases. Let us anticipate 
the existence of solutions in which the cross terms of (2a) are zero. Then (i) if ^ is 
independent of vn, the third term vanishing, or (ii) if ^ is independent of n, the second 
term vanishing, equation (2 a) reduces to the one-dimensional form, being equivalent 
to equation I (2). In either of these oases the final terms of (26) vanish, so that (26) 
has cowesponding simple cases with ‘one-dimensional’ solutions introducing no 
cross-term in (2a). Oases (i) and (ii) represent the forms assumed by Taylor ‘bridge’ 
and Burgers ‘screw’ dislocations respectively when they lie in the outermost glide- 
plane of a crystal. Thus surface dislocations of type (1, 0) (displacement vector 
(a, 0)) and type (0, 1) (displacement vector (0, a)) are mutually independent, even 
when intersecting or superposed, provided that they are of pure Taylor or pure 



218 


F. 0. Frank and J. H. van der Merwe 

9 

Burgers form, that is to say, that the dislocation lines run either normal or parallel 
to their displacement vectors. 

In the more general case of a straight dislocation inclined at an angle a to the 
a:-axis, so that 

V{n,m) = vip), p = mcosa-wsina, 

equations (2 a, 6) reduce, when the differences are approximated as differentials, to 
0 = 7 TTfsin 27 r^-a®{j£toos**a+ vsin^a) {d%jdp^)-a\v+X) sina cosa{d<hjldp% (3a) 
and 

0 = nW 008 ^ a + vsm^ac){d^^ldp^)~a’^(v + X) sina oosa(#^/tiy). (36) 

Neglecting the final term of (3 a), it reduces to the one^’dimensional form equi- 
valent to I (3) with 

Iq = a{{/iicoa^ix+v8ia^a)l2W}^. 

However, unless (j^ -I- A) sin a cos a is zero, the presence of C displacements then 
demands rj displacements also, to satisfy (36), which feed back a final perturbing 
term into (3a). The perturbing term is very small if (v-f- A) sin a cos a is small and 
there are no (0, 1) dislocations, Thus the one-dimensional equations give a good 
approximate representation of skew dislocations as well as those of normal form, 
but skew dislocations of (1,0) and (0, 1) type interfere with each other where they 
intersect or overlap. 

As a special case (1,1) dislocations of any inclination reduce simply to the one- 
dimensional form. For if ly = ± C equations (3a) and (36) both reduce to 

0 = TiW sin27r^— a*(/toos®a+ vsin^a ± (v4- A) sina cosa) {d^jdp*), (4) 

which is of one-dimensional form with 

Ig = a{(/tcos®a+ vsin^ai (v-t-A)sina cosa)/2F’}*, 
e.g. if a = Jtt we have for the (1, 1) Taylor dislocation 

= a{(/t-(-2v + A)/4Tf}i, 
and for the (1, T) Burgers dislocation « 

^0 = a{ifi-X)l4^K 

contrasted with 

lo^a{/il2W)* and lo = a{vl2W)i 

for the (1, 0) and (0, 1) Taylor and Burgers surface dislocations respectively. 

We omit further exploration of the mutual interaction of dislocations of different 
type, having established the general applicability of the one-dimensional equations 
to the two-dimensional problem, and the mutual independence of crossed disloca- 
tions in important cases. 

2. Intbodttction 

Having confirmed the relevance of the one-dimensional dislocation equations to 
the two-dimensional monolayer, we shall use these equations to calculate the energy 
and stability of dislocated and non-dislocated states of the monolayer, for various 



One-dimensional dislocations. II 


219 


degrees of misfit. Mathematically, it will be convenient to express the dislocation 
density indirectly, by means of the parameter k (the argument of the complete 
elliptic integrals). On this account we may advantageously preface the mathe- 
matical treatment by a qualitative explanation. Supposing there are no dislocations 
between monolayer and substrate, the monolayer therefore being homogeneously 
deformed to fit exactly on to the substrate, the strain energy involved depends 
quadratically upon the misfit. On the other hand, if there is no misfit, the formation 
of dislocations stores an amount of strain energy proportional to their number, 
whether they are positive or negative, until their density becomes high when there 
is additional energy due to their interaction. Thus, initially, the energy increases 
quadratically with the misfit and linearly wfth the dislocation density. When there 
is a finite amount of misfit (let us say, the monolayer is in compression) the energy 
required to form a positive dislocation is increased, and the energy required to form 
a negative dislocation is reduced. This reduction increases proportionally with the 
amount of misfit, and at a certain critical amount of misfit the net amount of energy 
required to make a dislocation becomes zero. Since the dislocations do not interact 
appreciably until they are close together, the dislocation density in the equilibrium 
state increases almost abruptly from zero to a large value on passing this critical 
amount of misfit. Thereafter, the dislocation density approaches asymptotically to 
the reciprocal of the vernier period of misfit, and the mean lattice spacing of the 
monolayer approaches asymptotically to its natural, unstrained, spacing. Numeri- 
cally, the critical amount of misfit is shown to correspond to a difference of 9 % in the 
lattice spacings for a simple case, though there may be rather wide variation about 
this value, depending on the relative forces exerted by monolayer atoms on each 
other and on the substrate respectively. The energy of adsorption varies parabolic- 
ally with the amount of misfit up to the critical value, and then rapidly approaches 
a constant asymptotic value. 

This is not the only critical condition of importance, for there still remains an 
activation energy for generation of dislocations, which only falls to zero with a larger 
degree of misfit (about 14 % in the sanie simple case). Unless the misfit exceeds this 
second critical value, it is possible for the monolayer to be deposited, at low tem- 
perature, in exact fit on the substrate. This represents a metastable state which 
could eventually be destroyed by thermal activation. 


3. MrSHT-DISLOOATIOlT DENSITY BBLATIONSHIPS AND STABILITY 

It.is clear from the foregoing that the energy and hence stability of a deposited 
monolayer on a crystalline substrate depends upon the degree of misfit and 
dislocation density. To every energy state, e.g. the lowest energy state, there 
corresponds a definite relationship between these two quantities. 

Referring back to part I, equation I (9) provides us with an expression for the 
dislocation density 1/P in temis of h‘. 

IIP * nl2lokK{k). (5) 



220 


F. C. Frank and J. H. yan. der Merwe 


Similarly equation 1(12) gives us the expression for the potential energy per dis- 
location of the deposit 

Vd = lf?g{4^(/fc)MoA+2(l;2-l)i:(A)K*-2/Po+P/P§}. 

Hence the mean potential energy per atom of the deposit is 

V = Vj)jP = W{2E{k)lk^K{k) -i- 1 - ljk^-7TlolkK(k) -1- 2g/P§}. (6) 


For a given monolayer and substrate the misfit is fixed, so that the energy of the 
system depends further only on the concentration of dislocations, which is measured 
by (1/P). Since 1/P is defined in terms of k it is convenient to make use of the 
relationship 


dv 

a(i/P)- 

Hence 


r,JVdk E{k) , d 

^ dkdP’ 1-k^ dK 


dV 

0(1/P) 


= 2Wlo{2E{k)l7Tk-l,IPo}. 



K(k) 
k^ ’ 


For values of ?j/i^>2/jr the lowest energy state is given by 3F/9(1/P) = 0, and 
therefore 


io/Po = 2E{k)l-nk. 


(7) 


For lower values of the expression for 9F/0(1/P) is always positive, and the 
lowest energy state is always given by 1/P = 0 up to the critical condition Zo/Pq =* 2/^, 
corresponding to ^! = 1 (see figure 1). 

In part I we have also derived the condition 1(16), namely, 

(l/P-l)i<Zo/-Po<l/*. ■ (8) 

which determines for a given value of k, and hence given 1/P, the l im its of 1/Po, 
between which the system is in a metastable state, when its boundaries are free. 
The limiting condition for there being any dislocations at all is fc = 1, and hence 
corresponds to 

0<Zo/Po<l. 

The hmit Zq/P,, = Ijk corresponds to spontaneous dislocation, i.e. the state in which 
the generation of another dislocation needs no activation energy, whereas the limit 
= (l/^*~ 1)* signifies spontaneous escape of dislocations, i.e. the state in which 
one of the existing dislocations can escape without activation energy. (For graph 
see figure 1.) 

The graphs refer to negative dislocations (Pq > 0, Zq > 0). The corresponding graphs 
for positive dislocations (Pq < 0, Zq < 0) are the reflexions of these curves in the Zq/I^ 
axis. This symmetrical character arises from our use of Hooke’s law for forces between 
atoms of the monolayer and cannot accurately correspond to reality. 

It may also be noted that 1/P = 6/a— 1, where 6 is the average lattice spacing of 
the dislocated system, and 1 /Pq = 6/a — 1, where 6 is its natural spacing. Thus the 
graph of Zo/P against Zo/P is likewise a graph of Zo(6/a- 1) against Z(,(6/a- 1). 



One-dimensional dislocations. II 


221 


y 

✓ 

✓ 



0 0-5 1 1-0 1-5 

77 

yp« 


Figure 1. Dependence of relative dislocation density IJP or J(,(5/a — 1) on relative misfit IJPq 
or !,(&/(* — 1). A, lowest energy state (note lf,/P = 0 when O^lJP^^Z/n). B, spontaneous 
dislocation. O, spontaneous escape. In an average case in which Zo~7, the critical con- 
dition lo/Pa — S/tt corresponds to a misfit of about 9 %. 

4. Variation of mean potential energy per atom 

WITH DISLOCATION DENSITY 

For a given monolayer and substrate, i.e. for given values of IJPa , the mean 
potential energy per atom as a function of relative dislocation density, V { IJP ), is 
determined by equations (6) and (6). (See figure 2 for graphs.) 

The variation of V against Iq/P, in the state of lowest energy of the system, is 
naturally one of great importance. The necessary relationship VoilJP ) is easily 
obtained by elimination of IqIPq between (6) and (7), giving 

VMP) = F{4JS?(jfc) VP + 1 - 1/P}. (9) 

In the limit when there are no dislocations (A = 1, IJPq = ijn) this is equal to 4Tf /P. 

Elimination of Hj/P,, between (6) and (8) gives the limiting values of V { IJP ) for 
which dislocations are statically persistent. Substitution of Iq/Pq = Ilk renders the 
function Vg { lJP ), which defines the relationship between relative dislocation density 
IJP and the limits of the mean energy per atom beyond which the chain will dis- 
locate spontaneously. Hence 

Vg ( lJP ) = W {{ 2 Eik ) - 7 r ) lk ^ K { k ) -h 1}. (10) 


Vol. 198. A. 


15 




222 


F. C. Frank and J. H. van der Merwe 


Whence it follows that the limiting value of the mean energy per atom, at which 
a chain containing no dislocations will start to dislocate spontaneously, is given by 

mo/p)h^x=f^- 

The limiting misfit concerned is (l/-Po)*-i = l/^o- 



0 0-5 1-0 1-5 

VP 


Fiqxike 2. Dependence of mean potential energy per atom (in units of W'’) on relative dislocation 
density, V(IJP) or F(«„[F/a-l]), with (o) Z„/Po = 0, (6) yPo = 0-6, (c) l„/P, = 1-0, 
(d) l^jPo = 1-1, shown over region of dislocation persistence. A, VM^)> lowest energy 
state; B, Vg(lJP), spontaneous dislocation; C, VJJJP), spontaneous escape. 

Similarly, substitution of IJPq = in (6) gives us the corresponding 

function for the spontaneous escape of dislocations 

V,ilJP) = W{2E -7T{1- ( 11 ) 

Again, in the limiting case of no dislocations, this is zero, as is lo/P^. 

5. Vaeiation or mean potential bneegy pee atom with misbtt 

The functional relationship of mean potential energy per atom with dislocation 
density becomes clear, when one writes (6) in the form 

F(yPo) = W{{klPo—^l^kK{k))^ + 2E{k)lk^K(k) + l~llk^-Tr^jik^K(k)^}, (12) 



One-dimensional dislocations. II 223 

which displays the parabolic relationship of F(Zo/Po), when 1/P and hence k is fixed. 
Apart from the shift of the vertex 

i7Tl2kK{k), 2E{k)lk^K{k) + 1 - l/jb^ _ 7r^l4:k^K{k)^) 

of the parabola by a change in dislocation density, it further remains unaltered, as 
shown by the graphs in figure 3, where there are also the corresponding graphs for 
the three most important states of the system. 

When b = b = 2a say, the mean energy per atom will obviously be zero. Hence 
there are also ordinates of symmetry at l/ig = 1 , 2 ,.... The mathematical expressions 
do not reveal this property, because of the approximation of a difference equation 
by a differential equation. 



0 0-5 1-0 1-5 

IJPo 


Figube 3. Dependence of mean potential energy per atom (in units of W) on relative misfit, 
VilolPo) or V(k[bla-I1). Parabolas: (a) klP = 0, (b) Z„/P = 0-506, (o) yP = 1-069, 
shown over region of dislocation persistence. A, V,j(lo/PD), lowest enei^y state, quadratic 
dependence up to critical point Ij/Po = S/tt, henceforth slow asymptotic increase to 1^/2; 
B, Fj(Z(|/Po), spontaneous dislocation; <7, F,(yPo)» spontaneous escape. Points on A, B 
and O determined from corresponding crnves in figures 1 and 2. 

, 6. Appucoatioits to msinTiiirG monolayers 

Application of the foregoing to misfitting monolayers involves knowledge of the 
physical parameter Zq = (/m^I2W)^. An estimate of Iq was already made in part I, § 8, 
for a monolayer on a crystalline substrate, both consisting of atoms of the same kind 

15-2 



224 


F. C. Frank and J. H. van der Merwe 

and assuming Lennard- Jones forces between atoms. The value obtained, namely, 

= 7, depends on the rigidity of the monolayer and the binding energy between 
atoms of the monolayer and substrate, and can therefore be appreciably different 
for different combinations. It will be larger when the atoms of deposit are relatively 
incompressible, and weakly bound to the substrate, smaller when they are relatively 
soft and strongly bound. 

The degree of misfit 1/Pq was defined for our one-dimensional model as {b^a)la, 
where b is the equilibrium separation of atoms in the chain and a the substrate 
potential wave-length. In our application of this theory to monolayers on crystalline 
substrates, b and a will generally be the nearest neighbour distances of atoms of the 
natural lattices of deposit and substrate in the junction plane. 

One may therefore expect that up to the critical misfit 2/7rZo~ 9 %, the deposited 
monolayer wall attain the same lattice spacing a.s the crystalline substrate, and will 
be free from dislocations and fixed with respect to the substrate. For degrees of 
misfit exceeding this critical value but less than the limiting value l/Zo~ 14 %, the 
generation of dislocations still needs an activation energy, and at sufficiently low 
temperatures monolayers can still be deposited undergoing deformation to a state 
of fit with the corresponding substrate. 

7. Oeiented oveegbowth 

Considerations on misfitting monolayers further lead to predictions for some of 
the necessary conditions under which the orientations of thin films grown on crystal- 
line substrates are related to those of the substrates. For degrees of misfit less than 
the critical values obtained in the previous section, the first monolayer that goes 
down takes up the same lattice spacings as the substrate, contains no dislocations, 
and is therefore rigidly fixed to the substrate. If the natural lattices of deposit and 
substrate are isomorphic, the pattern of atoms in this first monolayer is identical 
with that in a plane of the natural deposit lattice, and hence the next monolayer 
can grow on to this under conditions similar to those in which the first has grown 
on to the substrate. Evidently this growth process can also take place for non- 
isomorphic lattices^ provided the lattice of the deposit contains planes whose atomic 
configuration resembles, that of the exposed surface of the substrate : this condition 
can only be satisfied on specific crystallographic surfaces of the substrate. 

One may, however, expect the critical conditions at the free boundaries to change 
with film thickness, e.g. thickening will certainly cause the generation of dislocations 
at the free boundaries of an initially undislocated film, since the energy required to 
compress the thick film is much greater. Thus, a layer of two atoms thick will have 
a critical misfit of the order of {2)~i that of a monolayer, since Iq is proportional to 
/t* (taking the elastic constant for a double layer to be twice that of a monolayer). 
On the other hand, once the first monolayer is completed it does not contain ‘free 
boundaries on a flat substrate ’, since the film will also be completed around comers 
and edges and hence the boundaries will be pinned to the substrate. Its roughness 
further stabilizes the hold of the substrate on the film. A stable oriented film, pseudo- 
morphic with the substrate, can therefore grow in this way, the essential condition 



One-dimensional dislocations. II 


225 


being that the first monolayer initially fits on to the substrate, has a aiTnilar atomic 
configuration to a plane in the natural deposit lattice, and covers any flat region of 
the surface completely. 

The large strains permissible in the thin films cannot, of course, persist into layers 
of indefinite thickness, a fact well established by experiments showing that pseudo- 
morphic growth is no longer observed in sufficiently thick films. It will obviously 
be impossible to grow maoroscopicaUy thick layers with more than, say, 0*1 %of 
strain, corresponding to the yield stress of the bulk material. Thickening of films is 
therefore accompanied by processes which allow the bulk of thick deposits to be 
free from the large strains of the initial thin films. T hi s transition from thin to thick 
films is worth more detailed theoretical and experimental investigation. 

The general problem of oriented overgrowth on various surfaces of a substrate 
non-isomorphic with the deposit is exceedingly complicated, but from similar 
theoretical considerations to those above, one may anticipate that a two-degree 
orientation can exist when there is a similarity in spacing in one row of closely packed 
atoms in each lattice, as remarked by Wilman (1940). 

We have already pointed out that one and the same critical misfit is not to be 
expected for all sorts of substances on aU sorts of substrates, but various critical 
misfits for various classes of case, viz. large critical misfit when there is particularly 
strong binding between deposit and substrate and low rigidity of deposit, and con- 
versely. Tor example, the large critical misfit characteristic of the oxides and iodides 
can be partly accounted for by the greater deformability of oxide and iodide ions 
respectively. 

These theoretical considerations are in good general agreement with the experi- 
mental observations which are summarized by van der Merwe (1949). 

We should like to thank T. R. N. Nabarro for discussion and constructive 
criticism. 

Refeebnoes 

van der Merwe, J. H. 1949 To be submitted to the Faraday Society Discussion on. Crystal 
Growth, April 1949. 

Wilman, H. 1940 Froo. Phya. Soc. 52, 323. 



The rate of evaporation of droplets. 

II. The influence of changes of temperature and of the 
surrounding gas on the rate of evaporation of 
drops of di-w-butyl phthalate 

By J. Bikks and R. S. Bbadlby 

Dejiartment of Inorganic and Physical Chemistry, University of Leeds 
{Communicated by M. 0. Evans, F.B.S. — Received 28 December 1948) 


The vapotir pressure of di-n-butyl phthalate has been determined by Koiudsen’s method at 
16 to 40° C and is given by logio33(in microns) ==: —4790/T+ 14*602. The latent heat of 
evaporation is 21,910 cal. per mole. Rates of effusion through a small hole in the presence 
of pressures of air 2 to 0*01 cm. have also been determined and shown to conform to the 
equation rate of loss of mass = .4./(J5+P), where A and B are constants. The rate of evapora- 
tion of drops of di-ii-butyl phthalate about 0*6 mm. in radius has been determined at 16 to 
40° C, in air at pressures 20 to 0*01 cm. and in hydrogen at 19*90° C and Freon 12 (CCljFa) 
at 19*90 and 30*00° C and has been accounted for theoretically. Within the experimental 
error the evaporation coefficient is unity at all temperatures. The diffusion coefficient D in 
air at 76 cm. varies from 0*0341 to 0*0473 for a temperature variation 15 to 40° C, D being 
proportional to P®. The diffusion coejSicient in Hj at 19*90° C is 0*163 for 76 cm., and for 
diffusion in Freon 12 at 76 cm. is 0*0126 at 19*90° C and 0*0140 at 30*00° C. The collision 
radius of di-n-butyl phthalate is 4-46A as determined from the experiments in air at 19*90° C 
and the sum of the radii of di-n-butyl phthalate and air molecules is proportional to T***. 
From the experiments in hydrogen the collision radius of di-n-butyl phthalate is 4*68 A at 
19*90° C and from the experiments \ising Freon 12, 6*24 A at 19*90° C. The shape of the drops 
has been studied, and errors introduced by assuming a spherical shape are shown to bo 
negligible. 


Intkodvction 

In part I (Bradley, Evans & Whytlaw-Gray 1946) experiments on the rate of 
evaporation of drops of di-u-butyl phthalate at 20° C "were described, and were 
shown to be adequately accormted for theoretically. This work has been extended 
to cover the temperature range 15 to 40° C, and, in addition to air, as the diffusion 
medium hydrogen and Freon (CClaF,) have also been used. In a manner similar to 
that described in part I the vapour pressures of di-»-butyl phthalate were also 
determined over the same temperature range, with the results described in the next 
section. 


The vapottb hbbsstjee of Di-n-BTrocYL bhthalatb at 16 to 40° 0 

The di-w-butyl phthalate was shaken with 10 % sodium carbonate solution to 
remove phthalic acid, and after washing and drying was vacuum-distUled at 215° 0; 
it was then subjected to molecular distillation at less than 10~®cm. of mercury 
at 90° C. The naiddle fraction, which was used for evaporation, had a refractive index 
njj 1*49292 at 20° C and densities given by the equation 


/ol'= 1*06199 -8* 15 xl0-4<', 

[ 226 ] 



The rate of evaporation of droplets. II 227 

where t' is the centigrade temperature. Other materials used, such as mercury and 
benzophenone, were also purified. 

A differential lOaudsen method was used, as described in part I, the loss in weight 
of tubes containing the liquid, and closed by platinum caps pierced by small holes, 
being determined over periods of from 5 to 10 hr. In this work and in the droplet 
evaporation work described later the air thermostat used in part I was replaced by 
a water thermostat constant to 0'01°C, and fitted with a removable plate-glass 
front to facilitate manipulations. The tubes were calibrated using mercury and 
benzophenone, the vapour pressure of which is given by Neumann & Volker 
( 1932 ); stable solid benzophenone logi^p = —4966/2’+ 17-46, liquid benzophenone 
logio^J = —4087/2’+ 14-75, P ^ microns of mercury. An additional check was 
provided by comparing the results with liquid and sohd benzophenone, good agree- 
ment between the calibration constants being obtained. The absence of self-cooling 
was revealed by the agreement between the results for hole sizes 0-0096 to 0-0185 cm.^ 
in area ; the diameter of the hole was less than the mean free path of vapour molecules, 
a necessary condition for Knudsen’s formula to apply. Results are given in table 1 . 


Table 1. Vapour pressure op di-«-butyl phthalate, using tubes 


temp. (°C) 

p X 10® (mm.) 

p X 10® (mm.) 
smoothed value 

19*90 

1*43 

1*42 

23*00 

2*06 

2*11 

26‘00 

2*7? 

2*72 

30*00 

5*0 

5*00 

35*00 

8*85 

9*00 

40*00 

15*8 

15*96 

43*00 

22*0 

22*3 

43*50 

22-2 

23*6 

44*00 

26*1 

25*0 


The smoothed values were taken from the equation 

logio 2 > = — 4790/2’ + 14-502 {p in microns of mercury), (1) 

which gave a latent heat of evaporation of 21,910 cal. ± 5 % per mole. 

The disadvantage of the method described, using a micro-analytical balance, is 
that it is a slow process, and errors are introduced in timing, since the time when 
effusion starts on reducing the pressure is uncertain. These errors are reduced by 
using long times, but weighing errors may arise from condensation of moisture, etc. 
It was therefore decided to use the quartz microbalance to weigh the effusion 
apparatus, so that the weight could be continuously recorded over short periods. 
Thi s necessitated a new design for the effusion vessel, which had to be light enough 
to be carried by the quartz microbalance. 

Pyrex tubing was drawn into a capillary, and a bulb 1 cm. in diameter was blown 
at the end and two small holes oh opposite sides of the bulb were then blown by 
means of point flames. This effusion pot carrying 0-02 g. of liquid was hung from the 
quartz microbalance in the neighbourhood of a suitable absorbent, and deflexion- 
time readings were taken with a hard vacuum in the apparatus. The liquid was 



228 J* Birks and R. S, Bradley 

injected into the pot by means of micro-syringe, care being taken not to wet the sides 
of the holes. 

Some difSculty was encountered over the choice of absorbent, tricresyl phosphate 
being finally used. It will be recalled that in part I charcoal gave good results for 
the absorption of vapours from droplets down to air pressures of about 0-1 mm. of 
mercury. It was not found possible, however, to obtain a hard vacuum round a 
T^angiTig pot oharcoal as absorbent, as was revealed by low effusion rates. This 
is not surprising considering that the charcoal cage had to be assembled round the 
•hg.TigiTig pot in the presence of moist air from the empty thermostat. The best results 
with charcoal were obtained by successive introduction and removal of small 
quantities of air. Similar difficulties were encountered using silica gel. A surface near 
the ViangTTig pot and cooled by liquid air was also used, but the cooling had to be 
started when the system was evacuated, to prevent cooling the pot, and this cooling 
set up osciQations in the balance. No doubt these could have been removed by electro- 
magnetic damping, but this was considered an unnecessary elaboration, especially 
as the bfl.nging pot showed signs of cooling, i.e. lower rates of effusion, even in the 
hardest vacuum. With liquid air around an external trap the initial effusion rates 
were in agreement with the results using tricresyl phosphate, but decreased with 
time, owing possibly to build up of vapour. 

A pool of degassed tricresyl phosphate gave reproducible results. This liquid has 
an extremely low vapour pressure, since a drop hung on the microbalance and 
subjected to a hard vacuum for 3 hr. at 20° C gave no measurable deflexion, i.e. the 
vapour pressure was less than 2 x 10"® mm. of mercury. 

The difficulty with hanging pots, which is almost absent with the tubes embedded 
in metal, is the self-cooling. Since the vacuum round the pots was hard, effusion 
resulted in continuous cooling (contrast the self-cooling of drops discussed later) 
and runs were therefore made as short as possible. If Q is the rate of loss of mass, 
in g./sec., t the time in sec.. Si the specific heat of the substance studied and Wi its 
weight, S^ and the corresponding quantities for Pyrex glass, LjM the latent heat 
of evaporation per g., M the molecular weight, LT the self-cooling, then 

^T = ^tQ^{8lWl+S^W,). 

With di-m-butyl phthalate this gives self-cooling corresponding to a decrease in rate 
of less than 3 %, but with benzophenone errors of 8 % are possible; this implies that 
the vapour pressures in the most unfavourable case are in error to the extent of 
5 %. However, no allowance is made in this calculation for conduction and radiation. 
Results are given in table 2. 

Owing to self-cooling the hanging pot is less reliable than the tube at high 
temperature, but the reverse is true of the lower temperatures. The final equation 
for vapour pressure is that given above. 

Our results for the vapom pressure and latent heat of di-n-butyl phthalate may 
be compared with the published values given in table 3. The results for di-»-butyl 
phthalate published in part I (3-05 x 10"® mm. at 20° C) are high, possibly owing to 
impurity; we believe equation (1) to be accurate to 6 %. 



229 


The rate of evaporation of droplets. II 

Table 2. Vapotje peesstobs of di-»-btjtyl ehthalatb 
IN 10~® MM. OB MEEOTTEY 


temp. 

pot 4 
hole area 


pot 5 
hole area 

differential 

Knudsen method 

ro 

0-751 mm .2 

0-48 mm.^ 

(smoothed) 

15-00 

0-73 


0-75 

0-75 (extrapolated) 

19-90 

1*31 


1-38 

1-42 

25-00 

2-46 


2-59 

2-72 

30-00 

4-84 


4-91 

6-00 

35-00 

8-7 


8-53 

9-00 

40-00 

— 


14-4 

16-00 

Table 3. Fttblished vapoite peesstjebs op di-w-butyl phthalate 

author 

vapour 
pressure 
(mm. X 10®) 

temp, 

rc) 

latent 
heat of 
evaporation 

method 

Zabel ( 1933 ) 

4-2 

23±2 

— 

ionization gauge 

Woodland <& Mack ( 1933 ) 

0-31 

25 

— 

Eoxudsen effusion 

Hickman ( 1936 ) 

1-9 

26 

— 

ionization gauge 

Hickman ( 1937 ) 

3-27 

25 

21,400 

extrapolated from 50 to 

Verhoek & Marshall ( 1939 ) 

1-29 

25 

23,400 

100 ° C using hanging 
disk 
do. 

Burrows ( 1946 ) 

1-9 

25 

— 

extrapolated from 150°C 

Kapff & Jacobs ( 1947 ) 

1-0 

25 

25,500 

moleoidar dew apparatus 

this paper 

2-72 

26 

21,910 

Knudsen effusion 


When the sohd absorbents ■were used some concern was felt as to the attainment 
of high vacuum in the neighbourhood of the hanging pot. A study was therefore made 
on the influence of air pressure on effusion rate, in order to determine whether the 
rates of effusion in a hard vacuum obtained by extrapolation from high pressures 
agreed with the experimental hard- vacuum rates. - So far as we know the influence 
of air pressure has not been previously studied, and because of their intrinsic interest 
our results are briefly reported in the next section, although they were not used in 
calculations of the vapour pressure. 


Rates oe eeetjsion feom a hanging pot in the pebsbnoe op aib 

Two pots were used, of hole areas 2-11 mm.® (pot 2) and 0-761 mm.® (pot 4) and 
rates of effusion studied at air pressures of 2 to 0-01 cm., usiag di-w-butyl phthalate 
and benzophenone. In all cases the rates of loss of mass Q was found to be expressed 
by an equation of the form 

P{BIP + iy 

the vacuum rate of evaporation being AjB. 

Table 4 gives the results with tricresyl phosphate as absorbent, using pot 2, for 
solid benzophenone at 19-9° C. Each value of Q was itself derived from a number of 
points on the line for deflexion of microbalance versus time. By plotting QojQ 



230 


J. Birks and R. S. Bradley 


versus 1/P (where Qq is the rate A/P, derived from the first part of the Q versus 1/P 
curve, which is nearly linear) the constant £ is easily derived, giving 


0-16 X 10-« 

P(0-0085/P+l)^‘^^ 


(P in cm. of mercury). 


Table 4. Eefttsion oe benzophenokb at a eangb oe air pressures 


P (cm. Hg) 

Q X 10® g.rQm.~i 

Q X 10® g.mm.“^ 
(calc.) 

2-120* 

0-0775 

0-0753 

1-365 

0-125 

0-116 

1-002* 

0-151 

0-158 

0-601 

0-270 

0-263 

0-564* 

0-265 

0-280 

0-288* 

0-514 

0-540 

0-282 

0-516 

0-551 

0-0740 

1-95 

1-94 

0-0507 

2-66 

2-70 

0-0337 

3*81 

3-80 

0-0226 

5-16 

5-15 

0 

— 

18-8 


* Charcoal as absorbent. 


It should be noted that there is no disadvantage to the use of charcoal as absorbent 
in this region of air pressure; indeed, it functions as well as for the droplet runs. 
For pot 2, using di-n-butyl phthalate at 19-9° C, 

8-15x10-9 . , 

^ P(0-0080/P+l) (-P in cm. of mercury). 

For pot 4 the results were: 

Sohd benzophenone Q = ^lou^P+i) (P in cm. of mercury). 


Di-n-butyl phthalate Q - g.min.-i (P in cm. of mercury). 

These equations enable us to calculate the rate of evaporation in vacm, whence 
the vapour pressure may be derived; for pot 2 the vapour pressure of di-»-butyl 
phtM^te was 1-62 x lO-^mm., for pot 4, 1-67 x 10-® mm., in reasonable agreement 
with the results quoted in the previous section, 1-42 x 10-9mm at 19-9® C 
Eesulto sugg«ted that coBicaJ diffusion from the hole was oocuiting, the vapour 
spr^ out fau-wise. An ezaot treatment is intraotaWe, but an approximate 

anal^ on hues sumlar to those used in part I (where, however, the treatment was 

exact) IS given below. 

• concentration of vapour in the pot with two holes of radius a 

is Co, and that c^ is the concentration of vapour just outside the hole. Let the half- 
angle of the cone be a/r. The rate of loss mass by conical diffusion from the region 
just outside the hole is given by 


dm na^ m^Dc 



231 


The rate of evaporation of droplets. II 

where D is the diffusion coefficient at the air pressure used, is the mass of diffusing 
molecule, and the distance from hole to absorbent. The rate of transfer of mass 
across the hole is 

= ’Ta%l2V(Co-Ci), 

/ kT V 

where V = • Equating these values gives the value of Ci, whence for two holes, 

dm 27TaDm^Q 
av a\ rj 


or for 


dm ___ 27TaDm2Co{alr) 
av\rj 


(3) 


Results suggest that a/r = 0-61. 

Although equation (2) is semi-empiricaJ, it is supported by the following calcula- 
tions. Equation (3) may be written 


dm _ 2nam^KYpf^ 

~1I~ kTP[KTI{avP) + 1] ’ 

where K = ajr, p, is the saturation vapour pressure, P the air pressure and Y = DP. 
When P is large 

dm Q = ^ncm^KYpa 
~li ~ kTP 


and hence K Yp^ can be calculated from the slope of — dmjdt versus 1 / P. Moreover, 
K Y can be calculated from the B term of equation (2) , and hence p^ can be calculated. 
This gives for 19-9® C, pot 2, di-«.-butyl phthalate 1-42 x 10“®mm., solid benzo- 
phenone 3-22 x 10“*mm.; pot 4, di-Ji-butyl phthalate 1*64 x 10"® mm., solid benzo- 
phenone 3-0 x 10"* mm. These values are in reasonable agreement with those given 
in the previous section. Neumann & VoUrer’s (1932) value for benzophenone at 
19-9°0, being 3-26 x 10"*mm. Moreover, the ratio of the diffusion coefficients for 
di-ra-butyl phthalate and benzophenone is the ratio of the KY values, K being 
a constant, viz. l*78/2-27 = 0'785, which gives a reasonable value of the diffusion 
coefficient of benzophenone, viz. 0-049 at 1 atm., using the value for di-%-butyl 
phthalate reported later (0-0386). The agreement with the ratios of diffusion coeffi- 
cients as calculated from the high-pressure slope constant A of equation (2) is, how- 
ever, not satisfactory for pot 4, pot 2 giving D(di-}i-butyl phthalate)/D(benzo- 
phenone) = 0-765 and for pot 4, 0-95. 

The general agreement shows that equation (3) adequately represents the rate of 
effusion in air, and gives extrapolated vacuum rates in agreement with experiment. 
It should be noticed that even when the air pressure is approximately 1000 times 
the vapour pressure the rate of effusion has reached approximately 20 % of the 
vacuum rate (cf. table 4). 



232 


J. Birks and R. S. Bradley 


The bate or evatoratioit oe droblbts oe ni-«-BtTTYL phthalate in axe 

The apparatus was similar to that described in part I. Drops of radius approxi- 
mately 0-45 mm. were hung from a microbalanoe of sensitivity 34-5//.g./rara. de- 
flexion. The microbalance was calibrated by means of aluminium wire and also by 
the buoyancy method using quartz bulbs, with good agreement. 

Preliminary results showed that the original sample used in this research gave 
higher rates of evaporation at the higher pressures and lower rates at the lower 
pressures than the sample purified by molecular distillation. The —dsjdt versus 1/P 
curve for the pure sample cut the curve given in part I, impl 3 dng that the sample used 
in part I was impure. It appeared that the evaporation coefficient a was higher than 
the value’previously published, and in order to check this point a direct determination 
of the vacuum rate was made, 

A drop cannot be used for this purpose, owing to self-cooling in a hard vacuum. 
Hence a tube of radius 0-85 mm. and length 1 cm. was fiUed to the brim with di-n- 
butyl phthalate and the rate of evaporation in a hard vacuum was determined, using 
charcoal as absorbent. The liquid surface was almost flat. If — area of surface, 


/ci!w\ 1 / 27rfeT \* 


where jPo is the saturation vapour pressure, is the mass of evaporating molecule 
and m the mass of the drop. This gave a = 0-69, i.e. greater than the value given in 
part I, viz. 0-28. This work was not pursued owing to the difficulty in assessing the 
liquid surface and errors due to wetting the glass ; in the light of our more recent work 
the use of tricresyl phosphate rather than charcoal would have improved the method; 
however, the method was sufficiently accurate to show that a is near unity. 

The experiments on droplets gave results of the same general type as those 
reported in part I. The rate of change of surface (a) with ti m e (t) is given by 


_ ds Snmz Ycq 

““ dt~ / Y a y 

^ \Pava'*’a-+- A/ 


( 6 ) 


where Y = DP , p is the density of liquid, the saturation concentration of vapour 
in molecules per ml., D the diffusion coefficient at pressure P, A a magnitude defined 


in part I, j' = [{kT)j{2mn^y]^, and a is the drop radius. A typical example is given for 

. 16° C in table 5. Hach value of dsjdt was itself obtained from approximately ten 

. , %Ttm»Dca 

pomts; Jo = 2 — ^ 0 . 


No significant difference was obtained on repeating runs, or on replacing the 
absorbent by silica gel, activated alumina or Apiezon oil. This suggests that the 
effective pressure of di-ji-butyl phthalate on the surface of the absorbent is zero. 
The results of table 5 may be represented by the equation 


ds 


dt 


P 


0-920 xl0-« 

/7-65X10-* 1 

\ aP ^ l + 2-42xlO-«/(aP) 


r cm.® min.-’^ 

I (a in cm., P in cm. of mercury) 



233 


The rate of evaporation of droplets, II 

Table 5. Dibutyl phthalate in air at 19-9®. 

Activated charcoal absorbent 

q (cm.^xnin.-^) q (calc.) 



a (mean) 

xlOs 

^0 (cm.^min.-i) 

^0 

(cm.^min.-^) 

P (cm. Hg) 

(cm.) 

(obs.) 

X 10 ® 

q a + A 

xl 0 «) 

8-456 

0-0491 

0-103 

0-109 

0-06 

0-109 

3-812 

0-0486 

0-293 

0-241 

-0-18 

0-241 

2-139 

0-0455 

0-444 

0-431 

-0-003 

0-431 

2-101 

0-0481 

0-398 

0-438 

0-10 

0-438 

1-163 

0-0480 

0-815 

0-791 

-0-03 

0-791 

0-989 

0-0469 

0-941 

0-930 

- 0-01 

0-930 

0*656 

0-0478 

1-30 

1-40 

0-08 

1-38 

0-495 

0*0451 

1-80 

1-86 

0*03 

1-82 

0-421 

0-0475 

2-06 

2-19 

0-06 

2-14 

0-115 

0-0470 

7-20 

8-00 

0-157 

7-30 

0-0746 

0-0445 

11-4 

12-3 

0-148 

10-6 

0-0509 

0-0461 

15-3 

18-1 

0-276 

14-7 

0-0502 

0-0432 

14-9 

18-3 

0-326 

14-6 

0-0419 

0-0438 

17-3 

22-0 

0-386 

16-9 

0-0352 

0-0450 

19-6 

26-1 

0-465 

19-3 

0-0266 

0-0437 

24-5 

36-0 

0-643 

23-9 

0-0201 

0-0440 

29-3 

46-8 

0-774 

27-8 

0-0198 

0-0449 

28-7 

46-5 

0-833 

28-3 

0-0150 

0-0467 

32-6 

61-4 

1-146 

33-5 

0-0137 

0-0422 

32-6 

67-2 

1-342 

33-2 


Similar -results were obtained over tbe temperature range 14 to 40° C. If 

— -r- = — r-= r cm.* min."^ (a in cm., P in cm. of mercury). (6) 

PI — + I , 

^\aP^l + GI{aP)) 

The values of E, F and G are given in table 6. (G was calculated from the diffusion 
coefiScient obtained from E in the manner described later. At 40° C the rates were 
too fast to determine F accurately.) 


Table 6. Rates op evaporation op drops op di-»-bittyl 

PHTHALATE (CHARCOAL ABSORBENT) 


temp. 

rc) 

JSxlQ^ 

^xlO^ 

GxlO^ 

diffusion 
coefficient at 
76 cm. Hg 

6^12 X 10 ® cm. 

15-00 

0-433 

7-8 

2-15 

0-034 

6-40 

19*90 

0-92 

7*85 

2-42 

0-0386 

6-32 

25-00 

1-86 

8-15 

2-57 

0-0415 

6-24 

30-00 

3-55 

8-55 

2-62 

0-0426 

6-17 

36-00 

6*40 

8*62 

2-70 

0-0442 

6-08 

40-00 

12-0 

— 

2-86 

0-0473 

6-01 


The above values were calculated after a small self-cooling correction had been 
applied as follows. If a' is the thermal accommodation coefficient (allowing for 
internal degrees of freedom), it is readily shown that the heat transfer per second to 


\7Tm- 



234 


J. Birks and R. S. Bradley 


■where P is the aiy pressure, wij the mass of an. *air molecule , the temperature of 
the air and the temperature of the drop. If all other heat flows are uogleoted, this 
will be balanced by the rate of loss of heat by evaporation, once the drop has attained 
the steady state, i.e. 

Ldm , , 


or 


where q = • 


\7rmJ 

ST T T- 

61 - 1 ^ ''0"8aPJifa'\27rA/ 

2 dm 
ap dt 


. Taking a' = 1 for a liquid surface 


ST = 2-lBqliaP) at 25-00° C and ST = 2-l9qliaP) at 30-00° 0. 

The largest corrections are of the order of 6 % decrease in observed evaporation rate 
(for the lowest pressures), and scarcely affect the results. 

From the values of E and the vapour pressures the diffusion coefficients may be 

calculated, since E = - . These are listed in table 6; the value of D at 19-9° 0 

P 

is slightly greater than that reported in part I. It will be observed that D increases 
as T increases. This implies that the collision radiu^ decreases as T increases owing 
to the increased kinetic energy of impact. The collision radius i.e. the sum of 
the radii of gas molecule and di-w-butyl phthalate, has been calculated from the 
equation 

” 1 1 

TTm^mj J ’ 


D = ? 


3(l+aai)7!r»S'f2L 


where n is the number of molecules per cm.®, m^ and m^ the masses of ‘ air molecules ’ 
and evaporating molecules (cf. Jeans 1940), taking a^i =* and is listed in table 6. 
When logP is plotted against logP a straight line is obtained, from the slope of 
which it appears that D is proportional to P®, like'wise P~*. This temperature 
dependence for D is rather greater than would be expected thepretically, simple 
kinetic theory giving DxT^ at constant pressure, while for molecules repelling 


according to the inverse y? power of the distance, Dx (cf. Chapman 


& Cowling 1939, p. 248). Sutherland’s formula gives D = jDq P */(1 + CjT) at constant 
pressure, where (7 is a constant, or D = DqT^I{T+0). Thus the maximum power 
of P in the empirical expression D = DqT^ is f, obtaining when C>P, and the 
minimum is f (for C^T). When the force between two molecules contains an attrac- 
tive and repulsive term, and is of the form A/ryd— /t/jy, where the repulsive 
exponent /? is >7, the attractive exponent, and A and p are constants, Sutherland’s , 
equation becomes 




235 


The rate of evaporation of droplets. II 

(of. Chapman & Cowling, p. 181), and hence, if y = 7, 




(o+T?-i) 


It is clear that the maximum value of a: in the expression D — DqT^ will occur when 

/?-7 

(7 and /5>7, and will be f. Hence allowance for attractive forces will not 
account for the observed value, a: =* 3. 

While it must be remembered that the calculation of diffusion coef&cient involves 
a knowledge of the vapour pressure, and that only small changes occur in D over the 
temperature range considered, it must also be borne in mind that most work on Z) 
has been done on small molecules, and theoretical work is confined to simple molecules. 
A polyatomic molecule such as di-%-butyl phthalate might well be more deformable 
and D might well be more temperature-sensitive than, say, an argon atom. 

The values of deduced above are reasonable, and on subtraction of 1'87 A for 
air (Jeans 1940) give reasonable values for the size of a di-m-butyl phthalate molecule, 
e.g. at 19-90°, radius = 4-45 A. The value deduced from the density of the liquid, 
assuming spherical close-packed molecules, is 4-3 A, and agrees with the value used 
in part I, deduced from the dimensions 6-28—1-87 = 4-41 A. 

This value of deduced from D was used in the calculation of A = kT^rrSliP), 
i.e. A was obtained from the constant E. It should be noted that even if A = 0 the 
dsjdt versus 1/P curve passes asymptotically to the vacuum rate, when an equation 
of the same form as equation (2) is obtained. It is found, however, that the use of 
an equation in which A = 0 leads to values of a = 1-29, i.e. greater than unity, which 
is impossible; to this extent the theory which introduces A is justified. 

In addition to the dififusion coefficient the evaporation coef&cient may be cal- 
culated. If j is the observed value of — dsjdt and the value which would be obtained 

if Langmuir’s theory applied (cf. part I), then — — is plotted against 1/P, 
and the slope ^ * 

whence a is readily calculated, since Y = DP. Besults are given in table 7. 


d 


I'k. 

- ® ] 


a+Aj 


Table 7, Evaporation oobebtoibnt 


temp.. 

slope = 

a 




a 

rc) 

Yfiava) 

(cm.) 

Y=DP 

V 

X 

smoothed 

15-00 

0-018 

0-0434 

2-59 

3702 

0-90 

0-97 

19-90 

0-017 

0-0450 

2-93 

3732 

1-03 

1-00 

25-00 

0-0187 

0*0436 

3-15 

3763 

1-03 

1-00 

ao-00 

0-0202 

0-0423 

3-24 

3795 

1-00 

1-00 

35-00 

0*0198 

0-0430 

3-36 

3826 

1-03 

1-05 


The smoothed values of a were calculated from the smoothed values of D obtained 
by plotting log I) versus log T. It is clear that a may be taken to be unity over the 
whole temperature range. 



236 


J. Birks and R. S. Bradley 

In addition, a further check on the latent heat of evaporation may be calculated. 
The vacuum rates of evaporation may be calculated from the equation for — dsjdt 
(equation (6)), viz. —ds[dt = EajF at hard vacuum. Hence per unit area 

dm _ Ea ap 1 _ Ep 
dt ~ E 2 4tna^ SttF 


for a hard vacuum. Theoretically this value = 
c?logjPo L 


d 


m.iPoa 

{2nm^kT)^ 
dm\ 


, and hence 


d{llT) B dil/T) 




taking a = 1. Hence L, the latent heat per mole, may be calculated from the slope 
of log (—dm/di) + 1 log r plotted against IjT. This gives L = 21,900 cal. per mole* 
in exact agreement with the value deduced from the vapour-pressure measurements. 
In addition, L may be calculated from the Langmuir portion of the ( — dsjdt) versus 

1/P curve, since in this region -^ = of 

log ( - dsjdt) - 2 log P + logp versus 1/P enables L to be calculated, the value 21,900 
being again obtained. 


The rate oe evaporation op droplets op di-?i-bxttvl 

PHTHALATE IN HYDROGEN AND FrEON 

Preliminary work with hydrogen using charcoal absorbent showed that the 
expected increase in diffusion coefficient occurred, but that the evaporation coefficient 
was reduced to 0-6. Since the latter should be independent of the ambient gas, either 
the theory was incorrect or the absorbent was at fault. The use of silica gel and 
activated alumina as absorbents gave the expected result, a = 0-99 and 1-01 respec- 
tively at 19' 9° C, whereas Apiezon oil on glass-wool gave a slightly lower value of 
a (0*9), but the same evaporation coefficients at the high pressure. Presumably the 
charcoal desorbs air slowly in spite of the technique used of repeated fillings with 
hydrogen and pumping down. 

With Freon 12 the same technique was employed, but it was found that Apiezon 
oil was the best absorbent. With silica gel the Langmuir region evaporation 
characteristics were the same as for Apiezon oil, but the low-pressure region (0"06 to 
0-01 cm.) gave lower rates, giving a = 0-7 at 19-90° C. The absorption of the readily 
condensible Freon on the silica gel was, moreover, too great to allow steady pressure 
readings to be taken, giving a wide scatter to the points. The results using the best 
absorbents are summarized in table 8, which gives the constants E, F and G of 
equation (6). 

The values of H, a and were calculated as in the previous section. The collision 
radius of Hg may be taken as 1 -36 A (J eans), and the radius of Freon estimated from 
the electron diffraction data of Brockway ( 1937 ) is 3-2 A. The value computed from 
the density for close-packed spheres is 3-0 A. Subtracting these values (3-2 A for 
Freon) from the values of 8^^ given in table 8 we obtain the following values for the 
radius of the di-n-butyl phthalate molecule at 19-90° 0: from the measurements 



The rate of evaporation of droplets. II 


237 


Table 8 . Evapoeation op deops op di-»-bdtyl phthalatb 
IN HYDEOGEN AND EeEON 


temp. 

(°C) 

gas 

a (mean) 
in cm. 

Ex 10« 

Fx 10* 

6^x104 

FxlO^ 

smoothed 

D at 

76 cm, 

(X 

X 10» 

cm. 

19-90 


0*0443 

3*65 

31-2 

2*61 

31*2 

0*153 

.0*99 

6*04 

19-90 

Freon 

0*0448 

0*30 

2-SI 

1*36 

2*57 

0*0126 

1*00 

8*44 

30-00 

Freon 

0*0428 

1*14 

2-98 

1*48 

2*80 

0*0140 

0*95 

8*21 


using Hg 4-68 A, from the measurements using Freon 5-24 A; the difference is pos- 
sibly due to the size of the Freon molecule, which does not penetrate the di-w-butyl 
phthalate molecule as deeply as ‘molecules of air The agreement for hydrogen and 


air supports the use of the factor 


\ 




which appears in all formulae for the 


diffusion coefficient, although the formula for rough spheres contains a further term 
dependent on m-i and involving radii of gyration (cf. Chapman & Cowling). The 
agreement in the value of a for hydrogen, air and Freon is good support for the theory. 


The shape op the deops 

The drops used were slightly smaller than those described in part I, and some doubt 
was felt as to their shape; up to a certain size the larger the drop the more spherical 
is the shape, when the drop is hanging at the top of a quartz fibre. Drops of radii 
0-66, 0-48 and 0-42 mm. were suspended from the top of the fibre usedfor evaporation, 
and shadow photographs were taken using an enlarger (magn. x 16), with a glass 
scale for calibration. The mean radius was calculated from the mass, and hence the 
surface, assuming a spherical drop. This was compared with the surface computed 
from the dimensions, assuming an ellipsoid of revolution. It was found that the 
errors involved in assuming spherical shape were less than 1 %. 


Discussion 

The value of a recorded m this paper for di-w-butyl phthalate is unity, but the 
work of Alty ( 1935 , 1937 ) and Baranaev ( 1946 ) shows that other polar molecules 
such as water and the aliphatic alcohols have much lower values of a. It should be 
noted, however, that a liquid such as di-»-butyl phthalate is ideal for the study of 
evaporation coefficients, since rates of evaporation are low enough for self-cooling 
to be negligible under proper conditions, and since the vapour pressure is so low that 
rates of evaporation at very low air pressures are not influenced by the presence of 
vapour molecules. The last point is of some importance with more volatile liquids. 
Our formulae, derived in part I, are readily applicable to a plane or approximately 
plane surface of area A-^, for which the vapour diffuses away from a region of con- 
centration Cl molecules per ml. given by 

AiVa{Co-Ci)^Dj^y 

16 


Vol. 198. A. 



238 


J. Birks and R. S. Bradley 

where Gq is the, saturation concentration in molecules per ml., Z) the diffusion coeih- 
cient at the air pressure used, I the length of the diffusion gradient, i.e. the distance 
from the liquid surface to the absorbing surface. Solving for as before, this gives 
for the rate of evaporation 


DA^ 

(i-A) 


Covam^ 


va + 


D 

(i-A) 




mfi-A) 

D 


g./sec. 


A similar formula has been derived by Brookfield, Fitzpatrick, Jackson, Matthews 
& Moelwyn-Hughes (1947), who used liquids in tubes, and tested the Langmuir 
region by absorbing the vapour in charcoal, or in sulphuric acid (in the case of water 
vapour) and determining the rate of evaporation as a function of I in air at atmo- 
spheric pressure. However, only the Langmuir region was studied. A similar experi- 
mental method was used by Langmuir & Schaeffer (1943). When the term 


is large compared with unity, the rate of evaporation will be given by 

(4— A) 

i.e. the ‘ Langmuir rate ’ ; I would have to be very small in order that — should 


be unity, since v is approx. 10* for water, D is approx. 0-25 in air at 1 atm., and even 
if a is 0-04 (Alty 1937), (I -A) would have to be 6 x 10"* cm. before the rate of 
evaporation began to approach the vacuum rate. Thus very small values of I would 
have to be used in order to determine a by this method; Brookfield et al. used 1-6 cm. 
as the smallest value of 1 . It might be possible to use the method at lower air pres- 
sures, e.g. at 7-6 mm. of air {It A) = 0-6mm. for^---^ = 1. 


The low values of a, if real, imply some impedance to the condensation of vapour, 
which is probably most simply thought of as (a) an energy barrier and (6) an orienta- 
tion factor such that favourable orientation is necessary for condensation. The 
combination of (a) and (b) would give a of the form a = where d is the 

orientation factor, which may alternatively be thought of as an entropy term, and 
S' the activation energy necessary to pass the barrier. The existence of an E' term 
is perhaps most clear in the work of Langmuir on the evaporation of water through 
monolayers spread on the water; Langmuir & Schaeffer (1943), however, did not 
calculate the value of the 6 factor. It should be noted that the vapour-pressure 
equation will not involve E', since the equations for evaporation and condensation 
will each involve a term e-^'/ier. it is hoped in the future to make a theoretical study 
of the value of a for complex molecules by the transition state method. 

The vacuum rate of loss of mass in g./cm.^/sec. during evaporation may be cal- 
culated from the data above to be 2 x 10 iOe- 2 Woo/«r^ and the vapour pressure in 
mm. of mercury to be 3-5 x lO^i The entropy of vaporization ASp, t,«.Tn-ng 

unit pressure as 1 mm. of mercury, is given by TA8p = AH + RTlog^p^^ and from 
our data at 25-00» C A8p = 0-0527 (kcal./degree), in reasonable agreement with the 
value deduced from the Barclay-Butler (1938) rule, ASIp = 0-0277 -1- 0-001 IL, 
which gives A8p = 0-0618. 



239 


The rate of evaporation of droplets. II 

This paper is based on work which was submitted as a thesis by one of us (J.B.) 
for partial requirement for the Ph.D. degree at the University of Leeds. Our thanks 
are due to Professor M. G. Evans, P.R.S., for his constant interest and encourage- 
ment. 

Rbe'ebences 

Alty, T. 1937 Proc. Boy, Soc, A, 161, 68. 

Alty, T. & Mackay, C. A. 1935 P'^oc. Boy. Soc. A, 149, 104. 

Baranaev, M. K. 1946 J. Phys. Chem. Soc. 20, 399. 

Barclay, I. M. & Butler, J. A. V. 1938 Trans. Faraday Soc. 34, 1445. 

Bradley, R. S., Evans, M. G. So Whytlaw-Gray, R. W. 1946 Proc. Boy. Soc. A, 186, 368. 
Brockway, L. O. 1937 J. Phys. Chem. 41, 185, 746. 

Brookfield, K. J., Fitzpatrick, A. D. N., Jackson, J. F., Matthews, J. B. <& Moelwyn-Hughes, 
E. A. 1947 Proc. Boy. Soc. A, 190, 59. 

Burrows, G. 1946 J. Soc. Chem. Ind., Lond., 65, 360. 

Chapman, S. <fe Cowling, T. G. 1939 The mathematical theory of non-uniform gases. Cambridge 
University Press. 

Hickman, K. C. D. 1936 J. Franklin Inst. 221, 383. 

Hickman, K. C, D., Hecker, J. C. So Embree, N. D. 1937 Industr. Engng Chem. (Anal, ed.), 
9, 264. 

Jeans, J. 1940 An introduction to the kinetic theory of gases. Cambridge University Press. 
Kapff, S. F. So Jacobs, R. B. 1947 Bev. Sci. Instrum. 18, 581. 

Langmuir, J. So Schaeffer, V. J. 1943 J. Franklin Inst. 235, 119. 

Neumann, K. So Volker, E. 1932 Z. Phys. Chem. A, 161, 33. 

Verhoek, F. H. So Marshall, A. L. 1939 J. Amer. Chem. Soc. 61, 2737. 

Woodland, D. J. So Mack, E. 1933 J. Amer. Chem. Soc. 55, 3149. 

Zabel, R. M. 1933 Bcv. Sci. Instrum. 4, 233. 


The rate of evaporation of droplets. III. Vapour pressures 
and rates of evaporation of straight-chain 
paraffin hydrocarbons 

By R. S. Beadley and A. D. Shellaed 
Department of Inorganic and Physical Chemistry, University of Leeds 

{Communicated by M. 0. Evans, F.B.8. — Received 28 December 1948) 


The vapour pressures, latent heats of vaporization and fusion and diffusion coefficients of 
the vapours in air, have been determined at 15 to 40° 0, by a combination of Knudsen’s 
vapour-pressure technique and studies on the rate of evaporation of drops and solid beads, 
for n-C^Hgg and n-OigHgs. The rates of evaporation of drops and solid beads agree 

with the previously published theory, and lead to a value of unity for the evaporation 
coefficient. A new experimental method is described for determining the self-cooling on 
evaporation of a drop. The bearing of the results on the size and shape of hydrocarbon 
molecules is discussed. 


iNTRODXTCTIOlSr 

In part I (Bradley, Evans & Whytlaw-Gray 1946 ) and part II (Birts & Bradley 1949 ) 
a description has been given of experiments on the vapour pressure and evaporation 
rate of di- 7 t-butyl phthalate. In this work the application of the* theory of Fuchs 



240 R. S. Bradley and A. D. Shellard 

(1934) to droplet evaporation was established. The present paper gives additional 
support to the theory, but the main purpose of the work was to apply the technique 
to a number of related molecules. The higher straight-chain paraffin hydrocarbons 
were chosen for this comparative study, not only on account of the interest attaching 
to the molecular configuration in the liquid and gaseous state, but also because of gaps 
in the published data for these substances. The values of the collision radii we have 
obtained and the mean configuration we consider probable are also of interest in 
kinetip studies. It is hoped later to extend the work to branched-chain paraffin 
hydrocarbons and to fiuorinated hydrocarbons and hydrocarbon mixtures. 


Pbeparation akd physical properties op the hydrocarbons 

TO-Hexadecane was prepared from cetyl iodide by reduction, and also by the Wurtz 
synthesis from %-octyl iodide; ft-ootadecane was prepared &om w-octadecyl alcohol, 
via the iodide. The Carbide and Carbon Chemicals Corporation of the XJ.S.A. very 
kindly gave us a sample of pure »-heptadecane, and we are also grateful to Professor 
Ubbelohde for a small sample of pure octadeoane which was useful for comparison 
purposes. The fibnal purification of the samples we prepared was ihade by molecular 
distillation. 

The following physical constants were determined: refractive index CigHj^ 
1-43469 (from the Wurtz reaction, which gave the purer product), Ci^Hgg 1-4348 
(determined by Dr Audsley of the Department of Organic Chemistry), CigHjg 
n^° 1-44296; density CigHgg 0-7743 CiTHg* pf 0-7746, 0-7711, plT 0-7676 

(determined by Dr Audsley), CigHgg 0-7721. These densities were necessary for 
the calculation of the drop surface, and since some experiments were made on solid 
hydrocarbon beads the density of the solids was also determined. Some difficulty 
was experienced in the use of a density bottle containing water, owing to air which 
was trapped by the greasy surface and which was difficult to remove even on pumping, 
and we therefore used a method we have not seen described in the literature, viz, 
the method of air buoyancy. Approximately 0-1 g. of hydrocarbon was suspended 
from the quartz microbalance of the type described in part I, and was counter- 
balanced by means of a quartz weight suspended from the other arm of the balance. 
The change in defiexion on changing the pressure was followed on a travelling micro- 
scope, and from the density of the quartz counterpoise that of the solid hydrocarbon 
could readily be calculated. With 0-15g, of hydrocarbon a change of latm. gave 
a defiexion of the order of 2 mm., which could be read to 0-002 mm. on the travelling 
microscope. Results are as foUows: CigHggpJf 0-866 ± 0-001, OiTHgg pJfO- 864 ± 0-004, 
CisHss 0-873 ± 0-004. 

As a further check on purity X-ray powder photographs of the solids were taken 
by Mr S. C. Nyburg, by kind permission of Professor E. G. Cox. The specimens were 
melted and introduced into a cellophane tube of diameter 1-6 mm., mounted in a 
camera of 9-6 cm. radius. TJnfiltered copper radiation was used for an exposure of 
1 hr. As far as could be detected the sample of octadecane supplied by Professor 
Ubbelohde was identical with ours; the same remark applies to the evaporation 
characteristics. 



241 


The rate of evaporation of droplets. Ill 

VAPOTTE PEBSStJEBS AND HEATS OB VAPOEIZATION 

The vapour pressures were determined by the Knudsen method, as described in 
parts I and II, using benzophenone as the calibrating substance. It was necessary, 
in the determination of the vapotu pressure of the solid hydrocarbons, to ailow them 
to solidify under vacuum to prevent explosive degassing of the solid. The effusion tubes 
were mounted in a solid aluminium block, and self-cooling appeared tp be absent, 
since the same results for vapour pressure, within the experimental error, were 
obtained with different-sized holes. This independence of hole size also suggests that 
the correct conditions for mean free path were maintained. According to Knudsen 
(1909) the mean free path of the effusing molecules should be at least ten times the 
hole diameter, from his observations on the effusion of carbon dioxide from a higher 
to a lower pressure. It is not clear, however, how Knudsen’s experimental result 
applies to our mode of operation, since the vapour in our experiments was main- 
tained at constant pressure in the tube and effused into a hard vacuum. As a criterion 
we have calculated from the evaporation data {vide infra) the mean free paths of 

_M!_L 

77 po 

where Po is the vapour pressure and 8yy the diameter of a hydrocarbon molecule, 
and in every instance is greater than the hole diameters (0- 14, 0-10 and 0-06 cm.), 
although not always ten times. Recently Zilberman-Granovskaja & Schugan (1940) 
have found, with a technique similar to ours, that ideal effusion wiU occur provided 
that the mean free path is greater than one-jfifth the diameter of the hole, a much 
less severe condition than Knudsen’s. It should be noticed, however, that departure 
from the true result with holes which are too large may occur through self-oooling 
or from a departure from ideal molecular flow, and it is not easy to disentangle the 
two factors. It should be noted, moreover, that the results are less dependent on 
the hardness of the vacuum than may be supposed, since we have found that an 
external pressure of 1000 times the vapour pressure reduces the rate of effusion by 
only 80 % (part II). Some workers have subtracted the small air pressure in the 
apparatus from the vapour pressure, in order to correct for the imperfection of the 
vacuum, but this is clearly incorrect; this again emphasizes the difference between 
our technique s^nd that due to Knudsen. 

Graphs of logpo when plotted against IjT gave good straight lines, in farther 
evidence of the correctness of the conditions of flow. The constants A and B in the 
— A 

equation log^oPo = —^ + B, Po in cm. of mercury, and the latent heats of fusion, 

Ljp, of vaporization of the solid, Lg, and of vaporization of the liquid, L, in calories 
per mole, are given in table 1. 

For Ti-OigHaj and 7i-Ci7H38 the latent heat for the solid has been calculated from 
two points only, whereas six points at different temperatures could be plotted for 
most of the other cases. The published values for the vapour pressures at room tem- 
perature are very scanty. Ubbelohde (1938) studied 7i-hexadecane, melting-point 
18-1° C, at 9 and 15° 0, and obtained consistent results at 9° C with different hole 
sizes, but at 15° 0 the use of holes of radii 0’33, 0-58 and 0-76 mm. gave vapour 


the hydrocarbon molecules for collision with like molecules, viz. Axx = 



242 B. S. Bradley and A. D. Shellard 

pressures of 1-38, 1-04 and 0-56 dynes om."^ respectively, Ubbelohde believed that 
the trend of the results at 16° 0 was related to premelting. To test this point we 
studied %-hexadecane at 16-97° 0, i.e. very near to the melting-point, and observed 
no abnormal behaviour in the vapour pressure and rate of evaporation; so far as our 
measurements go there is no evidence of premelting. Ubbelohde’s value for solid 
%-hexadecane at 9° C may be extrapolated to 14-97° 0, using his value of the latent 
heat of vaporization of the solid, 28,800 cal./mole, to give a vapour pressure of 
0-508 X 10“* cm. of mercury, in reasonable agreement with our value, 0-43 x 10“* cm. 
of mercury, considering the nature of the extiip-polation. 

Table 1. VAPOtrE pebsstjee constants and latent heats 

range 

hydro- of temp. 


carbon 

state 

rc) 

A 

B 

L 




liquid 

20^35 

4189 

10-260 

19,170 ± 
600 

— 

— 


solid 

15-17 

6579 

18-466 

— 

30,100 ± 
1000 

10,930 ± 
1600 

7Tr-C27H3g 

liquid 

25-40 

4374 

10-333 

20,020 ± 
700 

— 



solid 

15-20 

6866 

18-738 

— 

31,420 ± 
1100 

1 1,400 ± 
1800 

W-C28H38 

liquid 

30-40 

4730 

11-036 

21,640 ± 
800 

— 

— 

» 

solid 

16-25 

7995 

21,831 

— 

36,680 ± 
1200 

14,940 ± 
2000 


Our values of the latent heats of vaporization of the liquid hydrocarbons are in 
good agreement with those published by Rossini ( 1945 ), viz. CuHa* 19,360, C 17 H 34 
20,640, OigHjg 21,720 cal./mole. These and other values of Rossini support the view 
that the latent heat of vaporization of the liquids is a linear function of the number 
(w) of carbon atoms in the molecule. Huggins ( 1939 ), however, clai m ed that unpub- 
lished data of Rossini supported his view that on vaporization the hydrocarbon 
molecule became coiled into a ball, and that the latent heat of vaporization of the 
liquid was proportional to the more recent data of Rossini do not support this 
relation, but leave open the possibility of coiling. 


The bate op evapoeation op deops and beads op hydeooaebons 

The rate of evaporation of drops on to absorbents of charcoal and silica gel was 
studied as a function of the air pressure over a range of temperature in the manner 
described in parts I and II. As explained in these earher papers the rate of change of 
mass with time is given by 

dm inaDcama 
" dt ~ D ^ a ’ 
ava. a-fA 



243 


The rate of evaporation of droplets. Ill 

and the rate of change of surface with time by 

ds SirDcoW,^ 

~Jt~ TD , a 1’ 

' \jbva a + A_ 


where m is the mass, s the surface, a the radius and p the density of the drop, D the 
diffusion coefficient at the air pressure 'used, Cq the saturation concentration of 
hydrocarbon vapour in molecules per ml., the mass of the hydrocarbon molecule. 


a the evaporation coefficient, v = 



, k the Boltzmann constant, and A the 


effective thickness of the vacuous envelope surrounding the drop. 

The study of droplet evaporation thus falls into two sections, the first when Dj(avoc,) 
is small compared with a/(a+ A), which is then unity, when the ‘Langmuir’ rate is 
observed, and the diffusion coefficient may be calculated; the second when Dj{ava) 
cannot be neglected in comparison with a/(a+A), and the rate of evaporation 
approaches the vacuum rate. The second region is more difficult to study than with 
di-«.-butyl phthalate (cf. parts I and II) owing to the greater vapour pressure of the 
hydrocarbons, and the greater rate of evaporation at the lower pressures; sihca gel 
was found to be the best absorbent in this region. It is therefore not possible to study 
an isolated hydrocarbon drop at the lowest air pressures without making some 
allowance for self-cooling, a special study of which was therefore made. 

The diffusion coefficient may readily be calculated from the rate of evaporation 
in the ‘Langmuir ’ region, when —dsjdt = q = SirDcgm^jp = constant/P. The values 
of qP are given in table 2. 


Table 2. 

qP IK 10~^ OM.2 

MIK.”^ X CM. 

OF MBECURY 

temp. (°C) 



^"Ci8ll33 

14*97 

0*278 

0*058 

0*00889 

16*97 

0*395 

— 

— 

20*01 

0*671 

0*153 

0*02667 

24*98 

1*168 

0*3658 

0*07900 

29*98 

2*095 

0*6593 

0*217 

34*88 

3*535 

1*152 

0*3970 

39*94 

— 

1*967 

0*726 


The melting-points of w-Ci^Hs^, Ti-Ci^Hgg and w-CigHgg are respectively 18-1, 
22-0 and 28-1° C, so that some of the above data refers to solid hydrocarbons. 

Prom the vapour pressures listed ,above, the diffusion coefficient in 76 cm. of air 
may be calculated. When log D is plotted against log T a fair straight line is obtained, 
from which it appears that Doc T^, i.e. a smaller temperature dependence than 
for di-n-butyl phthalate (of. part II). The lines for liquid and solid hydrocarbons 
show a slight separation, which we attribute to surface irregularities on the sohd 
bead. The separation suggests that the rates of evaporation from the solid should be 
reduced'by 2 or 3 % to obtain the values for a perfectly smooth bead. The smoothed 
values for D listed in table 3 are taken from the plot of log D versus log T, and allow 
for surface irregularity. 



244 


R. S. Bradley and A. D. Shellard 


Tabm 3. DimrosioN oobb'bioibkt at 76 om. ob air, in c.o.s. units 

f 

• • n-CijHj4 M-CijHse 

temp. . • 

(°C) exp. smoothed exp. smoothed exp. sirioothed 

14-97 0-0384 0-0380 0-0403 0-0396 0-0388 0-0373 

16-97 0-0392 0-0386 ______ 

20-01 0-0392 0-0393 0-0422 0-0408 0-0396 0-0386 

24-98 0-0402 0-0406 0-0423 0-0423 0-0418 0-0397 

29-98 0-0425 0-0620 0-0436 0-0438 0-0408 0-0411 

34-88 0-0436 0-0436 0-0463 0-0462 0-0421 0-0426 

39.94 — 0-0463 0-0466 0-0446 0-0439 

It mil be seen that the values of D for the three hydrocarbons at the same tem- 
perature are very near to one another, the extreme differences being only 6 %. 
CigHsg and lie in the expected relation to one another, but C17H38 appears 

out of order. In view of the nature of the calculation of D from two experimental 
quantities, the rate of evaporation and the vapour pressure, both of which may be 
in error to a few per cent, it is not possible to state definitely whether or not the 
position of 0i,H3g is real and related to the odd number of carbon atoms. The signi- 
ficance of the approximate equaKty of the value of D for the three hydrocarbons is 
discussed in the last section. 

In contrast to the requirements for the diffusion coefficient, evaluation of the 
evaporation coefficient requires measurements to be made at comparatively low 
pressures of the order of 0-1 mm. of mercury, when the hydrocarbons studied evap- 
orate sufficiently fast for the self-cooling to be an important factor (contrast di-w- 
butyl phthalate, part II). This difficulty, for the liquid, is not removed by studying 
hydrocarbons of higher molecular weight, since the melting-point is raised and higher 
temperatures must be employed. The self-ooohng, however, will be less important 
for suitable branched-ohaia hydrocarbons, which have a lower melting-point than 
their straight-chain isomers, and which, it is hoped, will be studied in the future. 
In view of the uncertainties attaching to the thermal accommodation coefficient, 
and the contributions to thermal flow of conduction through the fibre and of radia- 
tion, an experimental study was made of self-cooling. 

A droplet of ootadeoane of the same size as that used for evaporation studies was 
suspended from a short fibre in the balance case, above a watch-glass containing 
sfiica gel, the micro-balance having been removed, and,was observed by means of 
a microscope. The essence of the method consisted in maintaining the temperature 
of the thermostat at a value slightly above the melting-point, and in reducing 
the pressure inside the ease -until, with a suitable adjustment of pressure and tem- 
perature, self-cooling just caused solidification; the self-cooling is then the difference 
in temperature between the melting-point and the temperature of the thermostat. 
In operation the method was complicated by the appearance of a second crystalline 
form of octadecane. The melting-point of the stable, form is 28-1® 0, but the drop 
did not solidify when kept for several hours at 28-0° C at atmospheric pressure, or 
for several hours at intermediate temperatures between 28-1 and 26-8 to 26-9° C. 
SoKdification to a transparent solid occurred, however, on cooling below 26-8 to 
26-9° C, and this solid reverted to the normal opaque form on rapid cooling. On 



The rate of evaporation of droplets. Ill 245 

heating the transparent solid it melted at 26-8 to 26-9'’ C, whereas the normal opaque 
form always melted at 28-l‘’ 0. This behaviour was strictly reproducible, and it was 
concluded that a metastable form of melting-point 26-8 to 26-9° 0 was formed by 
slow cooling of the liquid. In confirmation of this view Carey & Smith (1933) pre- 
pared a sample of octadecane of melting-point 27-9 to 28-0° 0, and a second form with 
a melting-point at c. 27-l° 0, and Ubbelohde (1938) showed that an arrest occurred 
in the cooling curve at 27-0° 0, and no viscous flow below 26-86° C. 

Because of this transition it was necessary to cool the liquid to just above the 
metaatable melting-point 26-8 to 26-9° 0. The pressme was then reduced, kept 
constant at a low value, and the drop was observed for signs of solidification. If no 
solidification occurred after 30 min. the pressure was reduced, kept constant, and 
the drop was again observed. If no solidification occurred at the lowest pressure 
used in the evaporation studies, the whole process was repeated at a series of lower 
temperatures, until solidification occurred. Values of the radius of the drop were 
also recorded at intervals. Results are given in table 4. 


Table 4. SELE-cooLmro oe deops 


temp. (°C) 

pressure in 

nature 

radius of 

self-cooling 

(thermostat) 

cm. Hg 

of drop 

drop (cm.) 

rc) 

27*7 

<0*001 

liquid 

— 

— 

27*5 

<0*001 

liquid 

— 

— 

27*4 

0*0089 

solid 

0*050 

0*5~0*6 

27*3 

0*01015 

solid 

0*057 

0*4-0*6 


The maximum theoretical self7Cooling was calculated in the manner described in 
part II, on the assumption that the thermal accommodation coefficient is unity, no 
allowance being made for conduction through the fibre or for radiation transfer of 
heat, and was found to be approximately five times the above experimental values. 
This factor was assumed to be applicable to all conditions, and the rates of evapora- 
tion corrected accordingly. In addition, for comparison, the rates were corrected 
on the assumption of ‘theoretical’ self-cooling. 

As in part II, the results for the evaporation of drops could be represented by the 
equation 

cm^min."^ {a m cm., P in cm. of mercury), 

where JE, F and &, corrected for ‘theoretical’ and ‘experimental’ self-cooling, 
are constants given in table 6, together with the corresponding values of the 
evaporation coefficient a, calculated as in part II. The greatest correction due 
to self-cooling occurred with Ji-octadecane at the lowest pressure, 0-0478 cm. of 
mercury, when —dsjdt observed was 34-95 x 10-®cm.®mm.-^, corrected for 
‘theoretical’ self-coofing 36-98 x 10~® cm.® min.~\ corrected for ‘experimental’ 
self-cooling 36-17 x 10~® cm.® min. -1. At higher pressures the corrections were 
correspondingly smaller. It will be seen that the ‘experimejital’ self-cooling brings 
the evaporation Coefficient to unity or less; the ‘theoretical’ self-cooling, however. 




246 


R. S. Bradley and A. D. Shellard 

gives valttes slightly greater than unity, which is an impossible result, and to this 
extent our procedure is justified. Although it was not possible, because of the 
greater rates of evaporation, to study octadecane and heptadeoane over the same 
range of pressure as di-w.-butyl phthalate, our results show that the evaporation 
eoelBicient may be taken as unity both for the liquid and solid straight-chain 
hydrocarbons. 


Table 6. Constants in the evabobation equation 

JPxlO* 


hydro- 

carbon 

temp. 

rc) 

BxW 

exp. self- 
cooling 

theo- 

retical 

GxlO* 

exp. self- 
cooling 

theo- 

retical 

Ti-CigHsg 

14*97 

0*889 

7*35 

7*00 

2*295 

1*00 

1*06 

20*01 

2*667 

7*60 ■ 

6*80 

2*342 

0*99 

MO 


29*98 

21*7 

8*62 

7*75 

2*460 

0*91 

1*02 

9‘1-Ci7B[38 

14*97 

5*80 

7*85 

7*05 

2*413 

0*96 

1*07 


Discussion on the size and shape oe hydrooabbon molecules 

As described in part II, values of the ‘ collision radius ’, i.e. the sum of the radii 
of hydrocarbon and ‘air molecules’, were obtained from the relation 

02 ^_2 1 kT pkT{m,+m^) J 

where and Wj are the respective masses of ‘air molecules’ and hydrocarbon, P is 
the pressure, the corresponding diffusion coefficient, and a factor, the 
inclusion of which is not wholly satisfactory (cf. Furry 1948), to allow for the per- 
sistence of velocities. Prom Jeans’s data is 0-304, 0-306 and 0-308 for the hydro- 
carbons n-CjgHjj, n-Ci7H38 and u-CisHjs respectively. By subtracting from Bx% the 
radius of an ‘ air molecule’, 1-87 A (Jeans), we obtain the values of the radii given in 
table 6. This assumes that the ‘ air molecules ’ have a constant radius over the small 
temperature range adopted, which is very nearly true. Allowance for the tem- 
perature coefficient of the radius of ‘air molecules’, as deduced from the Sutherland 
constant for air, would affect the figures for the higher temperature by only 0-6 %, 
and we have not included this correction because the Sutherland constant is deduced 
from experiments in which collisions between like molecules occur, whereas in our 
experiments ‘air molecules’ collide with hydrocarbon molecules. The last column 
in table 6 refers to the discussion below. 

Table 6. Radh oe hydrocabbon molecules deduced 

EROM COLLISION EORMULAB 


temp. (°C) 

?’1-CjqH34 

^■0^25130 






\l-a7 

14*97 

4*53 

4*40 

4-56 

1*705 

16*97 

4*52 

— 

— 

1*697 

20*01 

4*51 

4*37 

4*55 

1*686 

24*98 

4*49 

4*34 

4*53 

1*668 

29*98 

4*46 

4*31 

4*50 

1*652 

34*88 

4*22 

4*29 

4*47 

1*636 

39*94 

— 

4*27 

4*44 

1*620 



247 


The rate of evaporation of droplets. Ill 

The variation with temperature of the collision radius is another aspect of the 
temperature dependence of already discussed empirically. According to Suther- 
land this variation may be represented by 

81,=^8%0^ + CIT), 

where 0 is the Sutherland constant and 8„ the value of /Sja at very hi gh temperatures 
(the formula clearly does not apply to very low temperatures). Sutherland’s formula 
would correspond to a variation inD^j of the type D oc T^I{T+C). The Sutherland 
constants for the hydrocarbons have been calculated by plotting against T, 

and are 560, 590 and 548 for and w-CuHag respectively. These 

values are roughly the same, and may be compared with those of Melaven & Mack 
( 1932 ), who found from viscosity measurements on n-CTHjg, CgHig and C 9 H 20 the 
respective values 445, 337 and 276 for the Sutherland constants. All that can be 
said from this point of view is that our values of C are of the right order, since 
Sutherland’s constants were developed to cover collisions of like molecules, the case 
studied by Melaven & Mack. It should also be noticed that an analysis in terms of 
Sutherland’s formulae was not possible for di-%-butyl phthalate (cf. part 11), since 
the temperature variation of was too great. 

The temperature coefficient of and 8^ may also be treated in terms of repulsive 
and attractive forces (cf. part II), since the variation is within the maximum allowed 
for simple molecules, and is also in accord with the observations of von Obermayer 
( 1880 ), who found for experiments involving collisions between molecules of the 
more condensible gases and vapours, e.g. water and carbon dioxide, Dijoc T\ 
It is, however, somewhat unreal to treat a flexible molecule such as a hydrocarbon 
by the same methods as for simple molecules, and a more realistic treatment is 
given below. 

The values of the radii given above are in reasonable accord with those deduced 
from the density of the liquid, assuming close packing of spherical molecules as a 
very rough approximation to the true packing; thus for n-CigH 34 this assumption 
gives a value 4-41 A for the radius at 20 ° C, in comparison with 4-51 A deduced from 
the diffusion coefficient. Our results are not in agreement, however, with the 
empirical formula of Burk, Laskowski.& Lankelma ( 1941 ), based on the data of 
Titani ( 1930 ), and ofMelaven& Mack ( 1932 ) on the lower hydrocarbons up to nonane. 
The equation of Burk et al. for the radii of hydrocarbon molecules, viz. 

iSiTL = 0-23(n-2) + 1-84 (in A), 

where n is the number of carbon atoms in thOj^moleoule, gives for 

n.-CigH 34 = 5-06 A, and for w-CigHgg = 5-52 A, 

considerably greater than the experimental values. Moreover, the equation indicates 
that the radius increases by 0-23 A per CHa group, which is not in accordance with 
our results and is ixnlikely on general grounds for the higher hydrocarbons (vide infra) . 

Bor the solid hydrocarbons the work of MuUer ( 1925 - 41 ) shows that the hydro- 
carbon molecules are arranged as parallel zigzag chains, thermal oscillation being 
insufficient to overcome the intermolecular forces. The main influence of increasing 



248 


R. S. Bradley and A. D. SheUard 


temperature is to separate the ‘sideways’ distance between the chains. For the 
liquid state the evidence is less conclusive, although the X-ray studies of Stewart 
(1928) and of Warren (1933) suggest that there is considerable adlineation of the 
molecules. It is probable, however, that the increased distance between the chains, 
and the consequent weaker intermolecular forces, allows for some degree of statistical 
coiling to occur; intramolecular forces will, however, be small. 

In the vapour phase, however, this statistical coihng will be the predominant 
factor affecting the configuration.' As is well known very long flexible hydrocarbon 
chains with free rotation about the hnks will assume the configuration of greatest 

entropy, given by 2, where is the mean square of the distance 

between the centres of the end-carbon atoms, 2 is the number of links (i.e. 2 = « — 1 ), 
I is the length of a link, and 6 is the supplement of the valency aiigle. For shorter 
chains Eyring (1932) developed an exact formula, 

= P[z + 2(z — 1) cos d -I- 2(2— 2) cos^ d -t- . . . -1- 2 cos®“^ ^]. 

A direct summation for «.-octadecane gives = 1-54® x 33-09 A^, i.e. (i^)* = 8-86 A, 
instead of (i®)* = 8-98 A, the value for octadecane if the long chain formula were 
apphed. The difference in (i®)*, amounting to 1 %, is very small. 

With any given value of A a large number of configurations are possible, and the 
value of L gives only a rough guide to the mean molecular size, although it is clear 
from the above result for octadecane, when compared with the length of the mole- 
cule, that considerable coiling must occur in the vapour purely for statistical reasons. 
A more useful parameter for comparison with our results is the mean square of the 
distance between carbon atom centres and the mass centre (B^). Taylor (1948) writes 
for long chains and for a summation over all molecular configurations, 

W = i(L^), or (E2)* = 0-4083(l2)i. 

For octadecane this gives = 3-67 A, if we assume that the formula for Icing 
chains applies. It should be noted that this result refers to distances from the mass 
centre to the centres of carbon atoms. 

This value of is considerably less than the experimental collision radius at 
20‘’ C, viz. 4-65 A; no allowance has been made, however, for the van der Waals 
radius of the CHg group in the value 3-67 A for (2?*)*. Moreover, the temperature 
dependence of is small, being confined to the variation in 1. It is possible that 
no direct comparison can be made between hnd the radius found from collision 
formulae. There is a second possibility, however, namely, that the rotation about 
a bond is not free. Thermodynamic studies on the lower hydrocarbons have proved 
that energy barriers to free rotation exist, and this work has been extended to longer 
chains by Taylor {1948). Taylor finds that for long chains 


a' = I V’"W 3 ’)cos 56 # I j’' 


= il/, 


where 



The rate of evaporation of droplets. Ill 249 

V{<l>) being the potential energy for rotation about the z - 2 interior bonds, the zero 
of the angle of rotation of ^ being in the tram position. Taylor writes 

oos^) + (1 - jc) (1 - cos Sji)], 

where is the maximum value of V and the factors x and (1 — «) allow for the con- 
tributions of one- and threefold components respectively. From the thermodynamic 
data for the lower hydrocarbons = 4100cal./mole, and x= 0-26; the factor 

( l'^a ~) Taylor, the following values: — 50°C 2-02, 0°C 1-76, 

50“ 0 1-59, 100“ 0 1-48, 160“ 0 1-41, 200“ C 1-36,_300“C 1-29, 400“ C 1-23. There is 
thus a considerable temperature dependence of due to the change in statistical 
configuration, quite apart from any consideration of decreasing radius due to in- 
creasing velocity of impact on increasing the temperature. We have interpolated 
graphically the values of [(1 + a')/(l — ®')]* given in table 6, and find that our results 
on the radius deduced from collision formulae are considerably less than 

,n/ l + cosd \/ l+a' \ 7 
_6\1 — cosd/\l — av J 

Thus, for octadecane at 20“ C Taylor’s formula gives for {E^)i the value 6-19 Aj 
and at 35“ C the value 6-00 A. If allowance is made for the van der Waals radius of 
OHa the discrepancy will be greater. 

On the assumption of a Gaussian error distribution for a long chain with free 
rotation, the most probable and the mean values of L may be deduced firom the mean 
square values. Thus the mean value of i? = 3-36 A for octadecane. The various results 
we have obtained may be summarized (making a rough allowance for the van der 
Waals radius of OHa): 

(22®)* -4- v,an der Waals’s radius of CHa = 5-67 A; free rotation. 

M + van der Waals’s radius of CHj = 5-36 A; free rotation. 

(22®)*+ van der Waals’s radius of CHa = 8* 19 A; restricted rotation. 

Our results would not be greatly changed if we had used the first approximation for 
rigid elastic spheres of Chapman & Cowling (1939) for the calculation of the use 
of this approximation would involve increasing our values of S^z by 7 %. Higher 
approximations would involve changes of only a few per cent. Although no identi- 
fication of the experimental radius with the above radii is possible on theoretical 
grounds, it would have been expected that the theoretical result would be closer to 
the experimental value than is observed to be the case for restricted rotation. In 
this connexion it should be remembered that intramolecular forces, which will 
favour coiling, are neglected in Taylor’s theory. 

Our results seem to be best interpreted on the hypothesis of a helical molecule, 
as a representation of the mean position, with the ends at the statistical distance 
apart, for free rotation. We have constructed such a model from units kindly lent 
by Professor M. G. Evans, the units being moulded to give the correct relative van 
der Waals radii in the crystal. It will be seen from figure 1, that a coiled helix is 
possible for ^-octadecane, the molecule assuming an approximately spherical form. 



250 


R. S. Bradley and A. D. Shellard 



Figure 1. Coiled form of CigHgg 



Figure 2. Straight chain form of CjgHsf, 



Figure 3. Tet^a-?^ -butyl methane 




251 


The rate of evaporation of droplets. Ill 

This is in agreement with the close values of the radii of CX7H36 and CigHgQ, 

since a similar representation is possible for the two lower hydrocarbons. For com- 
parison figure 2 and 3 give the straight-chain form of octadecane, and the branched- 
chain molecule CxfS.2^Q, tetra-7^-butyl methane. The hypothesis of coiling had 
previously been advanced by Mack and his co-workers to explain the comparatively 
low values of the collision radii of the lower hydrocarbons, although with hydro- 
carbons up to O9H20 the 'helix’, on our model, would occupy scarcely one turn. 
When the theory of Taylor is applied to these lower hydrocarbons discrepancies 
appear similar to those discussed above. 

Our thanks are due to the Shell Refining and Marketing Company, Ltd., for a 
grant and for generously allowing one of us (A.D.S.) to take up the work at Leeds, 
in their employ, and to submit the work in partial fulfilment of the requirements for 
a Ph.D. degree at the University of Leeds. We are also grateful to Imperial Chemical 
Industries, who gave a grant for chemicals,*and to Professor M. G. Evans, F.R.S., for 
his constant interest and encouragement. 


ReB’EBENOES 

Birks, & Bradley, R. S. 1949 Proc. Roy. Soc. A, 198, 226. 

Bradley, R, S., Evans, M. G. & Whytlaw-Gray, R. W. 1946 Proc. Boy. Soc. A, 186, 368. 
Burk, R., Laskowski, L. & Lankelma, H. P. 1941 J. Amer. Ohem. Soc. 63, 3248. 

Carey, P. C. & Smith, J. C. 1933 J. Chem. Soc. pp. 346, 1348. 

Chapman, S. & Cowling, T. G. 1939 Mathematical theory of non-uniform gases. Cambridge 
University Press. 

Eyring, H. 1932 Phys. Rev. 39, 746. 

Fuchs, N. 1934 Phya. Z. Sowjet. 6, 225. 

Furry, W. H. 1948 Amer. J, Phys. 16, 163. 

Huggins, M. 1939 J. Phys. Chem. 43, 1083. 

Jeans, J. 1940 An introduction to the kinetic theory of gases. Cambridge University Press. 
Knudsen, M. 1909 Ann. Phys., Lpz., 29, 179. 

Melaven, R. M. & Mack, E. 1932 J. Amer. Chem. Soc. 54, 888. 

Muller, A. & Saville, W, B. 1925 J. Ohem. Soc. 127, 699. 

Muller, A. 1927 Proc. Roy. Soc. A, 124, 317. 

Muller, A. 1930 Proc. Roy. Soc. A, 127, 417. 

Muller, A. 1932 Proc. Roy. Soc. A, 138, 514. 

Muller, A. 1936 Proc. Roy. Soc. A, 154, 624. 

Muller, A. 1941 Proc. Roy. Soc. A, 178, 227. 

von Obermayer 1880 S.B. Akad. Wiss. Wien. 81, 1102. 

Rossini, F. D. 1945 Bur. Stand. J. Res., Wash., 34, 283. 

Stewart, J. 1928 Phys. Rev. 31, 174. 

Taylor, W, J. 1948 J. Chem. Phys. 16, 257. 

Titani, T. 1929 Bull. Chem. Soc. Japan, 4, 277. 

Titani, T. 1930 Bull. Chem. Soc. Japan, 5, 98. 

Ubbelohde, A. R. 1938 Trans. Faraday Soc. 34, 282. 

Warren, B. W. 1933 Phys. Rev. 44, 969. 

Zilberman-Granovskaja, A. A. & Schugan, E. A. 1940 J. Phys. Ohem. U.S.S.B. 14, 759. 



The autoxidation of tetralin 


By C. H. Bamyobd and M. J. S. Dbwab 
Oourtaulds Ltd., Maidenhead, Berks 

{Communicated by A. H. Wilson, F.R. 8 .—Iteceived 28 December 1948) 

The autoxidation of tetralin can be photosensitized by suitable vat dyes. This reaction can 
therefore be investigated by ‘standard photochemical techniques. The kinetics have in this 
way been shown to conform to the scheme suggested by Holland & Gee ( 1 946 ) for autoxidations. 
The absolute velocity constants for the elementary processes have been eyahiated, and the 
activation energies and frequency factors estimated (table 4 ). * 

The frequency factors of free radical reactions in solution have been discussed. It is suggested 
that the ‘normal’ frequency factor for such a reaction is about 10^. 

It is pointed out that the techniques described in this paper should be generally applicable to 
the study of free radical reactions in solution. 


Introduction 

The autoxidation of tetralin has previously been studied by George, Rideal & Robert- 
son (1946), George <fc Robertson (i 946^1, 6), George (1946 a, b) and Robertson & Waters 
( 1946). These authors now seem to agree that the oxidation is a radical chain process, 
following the general scheme of Bofland & Gee ( 1 946) , althdugh the evidence that the 
reaction does follow these kinetics is somewhat scanty. 

We have recently found that many vat dyes are able to photosensitize the autoxida- 
tion of tetralin (Bamford & Dewar 1948 a). By initiating the reaction in this way it is, 
possible to make use of standard photochemical techniques, and we have thus been 
able to confirm the general mechanism, and also to determine the absolute values of 
the velocity constants of the individual radical reactions. 

Experimental method 

The reactions were carried out in a Warburg type apparatus, and pressure changes 
were followed by means of a dibutyl phthalate manometer read to 0*01 mm. with 
a cathetometer. The consumption of oxygen could be calculated from the pressure 
change and the dimensions of the apparatus. The reaction vessel was a spherical 
bulb of approximately 20 ml. capacity, attached to the manometer by a glass spiral, 
and shaken mechanically at constant ^peed. The rate of shaking was normally about 
7 times per sec., and was measured stroboscopically. Five ml. of tetralin were used 
in each run. The whole apparatus was immersed in a transparent thermostat 
(±0‘01°C). The system could be evacuated to lO^^mm. Hg by a three-stage 
mercury-diffusion pump backed by an oil pump and could be filled with oxygen 
(previously purified by fractional distillation) at pressures measured on mercury or 
Apiezon oil manometers. 

In the photochemical experiments a IkW projection lamp was used as light 
source. A 21. flask filled with 10 % copper sulphate solution was used as a con- 
densing lens, and se^ed also to remove infra-red radiation. The solution 

[ 252 ] 


was 



253 


The ccutoxidation of tetralin 

circulated by a pump to avoid overheating. The rest of the system consisted of an iris 
diaphragm, an eleotromagnetically operated Compur shutter, and two collimating 
lenses. Filters of Crookes A glass and saturated sodium nitrite solution could be 
inserted. Intensities were measured by a calibrated thermopile potentiometer 
system. Normally both filters were used, cutting off light with A < 4200 A. 

Purification of materials 

Tetralin was shaken with concentrated sulphuric acid, and after washing and 
drying it was boiled under refiux for several hours with metallic sodium, in a nitrogen 
atmosphere. It was then distilled under nitrogen through a 15-plate column, and 
preserved in evacuated ampoules. 

The dyes, specially pure specimens of which were kindly supplied by I.C.I., were 
further purified by sublimation in mew. Solutions of the dyesin tetralin(0-05mg./ml.) 
were made up in evacuated tubes. 

Benzoyl peroxide was crystallized from chloroform and light petroleum, and dried 
in vacuo. 

Hydroquinone, and a- and /?-naphtho]s were pure commercial specimens. 


Kinetic scheme 

It is convenient to give the complete kinetic scheme at this stage. This differs from 
the scheme of Holland & Gee only in the mode of initiation of the chains: 


D + hv-^B* 

(jfabs.)' 


(JCf) 

jD* + + Oj 

(*x) 

D* + TK^I)R + T 

(^ 2 ) 

T + 02-^ 2^02 

(h) 

TO^+m^TO^Ti + T 

(h) 

2 T-^ 

(^ 5 ) 

T+TOz-^ 

(*«) 


(* 7 ) J 


jD and m represent dye and tetralin respectively; T is a radical formed from tetralin 
by loss of a hydrogen atom, and TO^ is a peroxide radical derived from T. For long 
chains, with the approximation (Holland & Gee 1946), this scheme 

leads to 

dm i hhm[Tm 

k,^ik,mi+h^h[TB.r ^ ^ 

where [Og]? is the concentration of oxygen in the liquid. The term in the first bracket 
on the right of (2) gives the rate of chain starting /. These equations assume that the 
light absorption is weak. It was established that this was so in the present 
experiments. 


Vol. 198. A. 


17 



254 


C. H. Bamford and M. J. S. Dewar 


Results at high oxygen pbessxjres 

At sufficiently Mgh oxygen pressures the second term on the right of (2) becomes 
effectively constant. If the rate of initiation is constant the rate of reaction should 
then be independent of the oxygen pressure. Initiation by benzoyl peroxide should 
fulfil this condition (cf. Bolland 1948). Table 1 shows that in fact the rate of reaction 
remained sensibly constant down to pressures of about 50 mm. At low pressures the 
rate fell as equation (2) requires; the dependence of the rate on the pressure will be 
discussed later. 


Table 1. Atjtoxidation oe tetbalin initiateb by benzoyl peboxide 
T = 25® C, benzoyl peroxide, 1% T = 45® C, benzoyl peroxide, 0*1 % 


jrgen pressure 

- Wd[0^ydt 

oxygen pressTure 

^Wd[0^ydt 

(Dom, Hg) 

(moLl.“^sec.“^) 

(mm. Hg) 

(mol.l.“^sec.~^) 

600 

3-09 

600 

5-98 

400 

3*04 

400 

5-97 

300 

3-04 

200 

5-95 

200 

3-01 

100 

6-98 

50 

2-98 

5-42 

5-64 

10 

2-91 

2-89 

5-30 

1-93 

2-72 

1-45 

4-57 

1-56 

2-64 

1-30 

4-31 

1-16 

2-63 

0-99 

4-02 

0-772 

2-30 

0-685 

3-37 

0-618 

2-07 

0-640 

3-03 

0-463 

1-85 

0-535 

2-77 

0-386 

1-68 

0-383 

2-14 

0-309 

1-48 

0-283 

1-73 

0-231 

1-21 



0-154 

0-86 



0-116 . 

0-66 



0-077 

0-47 



0-039 

0-26 




When the reaction is photosensitized by dyes, the rate for a given dye and light 
intensity is not constant at high oxygen pressures, but falls off as the pressure 
increases. This indicates that the rate of initiation must depend on the pressure. It 
is known that excited dyes are deactivated by oxygen. The second and third equations 
in scheme (1) allow for spontaneous (or solvent) deactivation and deactivation by 
<>37gen respectively. From equation (2) it follows that at high oxygen pressures 

Assuming that solutions of oxygen in tetralin obey Henry’s law a plot of {d[0z]ldt)-^ 
against pressure should be linear, and the ratio of slope to intercept gives the ‘ relative 
quenching coefficient’ q for oxygen, defined by 



The autoxidation of tetralin 255 

Figure 1 shows that (3) holds for Caledon Red BN and Caledon Golden Yellow GK, 
the values of q at 26° C being 2-89 x 10* and 1-27 x 10® respectively with the oxygen 
concentration in moles/litre. Throughout this paper the absorption coefficient of 
oxygen in tetralin is taken to be 0-10 (Fischer & Pfleiderer 192a), and the heat of 
solution has been neglected. The value of q for Red BN changes little with tem- 
perature, the value at 0° C estimated from figure 1 being 3-64 x 10®. 

(l.“ sec.® mol.“®) 
curve 

(1) (2) (3) 



FiGtriiBi 1. Determination of quenching. Curve (1), □, Caledon Golden Yellow GK, 26° C. 

Curve (2), 0, Caledon Red BN, 25° C. Curve (3), x , Caledon Red BN, 0° C. 

Effect of intensity 

According to equation (2) , the rate of reaction should be proportional to the square 
root of the intensity at constant oxygen pressure. A plot of logio I against log^ (rate) 
is given in figure 2. The intensity exponent is 0-55; the small discrepancy is probably 
due to the non-uniformity of the light beam across which the reaction vessel is 
shaken, and which would vary with the setting of the iris diaphragm. 

Measwement of the rate of chain starting 

In order to determine the absolute velocity constants it is necessary to know the 
rate of chain starting. This can convoiiently be measured by an inhibitor method. 
BoUand & ten Have (19470, 6) have shown that in the autoxidation of ethyl linoleate 

17-2 



256 C. H. Bamford and M. J. S. Dewar 

inhibited by phenols, only the peroxide radicals are remoTed by the inhibitor. 
BoUand & ten Have could not obtain a reliable value for the rate of chain starting, 
since in the peroxide-initiated reaction the nature of the initial radicals is uncertain. 
In our case, however, with photochemical initiation, the initial radicals are tetralyl, 
and each radical must react with oxygen before it can be removed by the inhibitor. 
The rate of oxygen uptake extrapolated to infimte inhibitor concentration should 



Figure 2. Dependence of rate on light intensity. 


then equal the rate of chain starting. The reaction below (velocity constant h^) 
must be included in the kinetic scheme 


TO 2 + P-^1 (kg)- 

For high inhibitor and oxygen concentrations this leads to 


d[0,] 

At 




hiP-] /■ 


(5) 

( 6 ) 


Under these conditions a plot of rate against 1/[P] should be a straight line, and the 
intercept corresponding to 1/[P] = 0 should be independent of the nature of the 
inhibitor and equal to the rate of chain starting. Figure 3 shows that equation (6) 
holds for reactions initiated by Cibanone YeUow 2GR and inhibited by a- and 
jff-naphthols and hydroquinone. The fact that the intercept (2'60 x 10“’ mol.l.“^) is 
not zero provides confirmation that the inhibitors do not react appreciably with the 
tetralyl radicals. The rate of reaction in the absence of inhibitors, under the same 
conditions was 1-055 x 10“®mol.l.“^sec.“^. Hence at 26° 0 


/ = 2-25 X l(P{d[0^ldt)K (7) 

Similar measurements at 46° C lead to the relation 

I = 0-915 X l(fi{d[02]IAt)K 


( 8 ) 



The autoxidation of tetralin 257 

Quenching of the excited dye by oxygen is irrelevant in these measurements, since 
all the experiments were carried out at the same oxygen pressure (600 mm.). It is, 
however, assumed that the phenol does not deactivate the excited dye significantly! 
The results of Weber (1948) and the results with the different inhibitors shown in 
figure 3, suggest that this is so under our conditions. 



0 5 10 r 


Figxtbb 3. Determination of rate of inhibition. 
O '/^-naphthol, □ hydroqninone, <$> a-naphthol 


At high, oxygen pressures equation (2) reduces to 

1 d[Oa] _ k^jTK] 

I dt ^ ihij 

From (8) and (9) and with [2’H] = 7-36 mol.l.“^, 

= 2-87 X 10“®l.imol.~*sec.-i at 25° C,'j 
= 4-60 X 10~®l.*mol.~isec.“^ at 46° C.J 


(9) 

( 10 ) 


Thus = 4’ Skoal. 

Incidentally, this technique cannot be used for reactions initiated by benzoyl 
peroxide. First, the nature of the initial radicals is uncertain; benzoate radicals 
would be removed by the inhibitor. Secondly, if phenyl radicals were formed, the 
concomitant evolution of carbon dioxide would interfere, and this effect would be 
augmented by any primary recombination of phenyl and benzoate radicals. In 
attempting to measure rates of chain starting by this method, we have in fact 
observed pressure increases, which imply that the rate of evolution of carbon dioxide 
is faster than the absorption of oxygen. 


Absolute determination of and by the rotating sector method 
From (1) it follows that the mean lifetime t of the kinetic chain is given by 

1 h[o^i+k^[m] 


( 11 ) 



258 C. Bamford and M. J. S. Dewar 

At high oxygen pressures this reduces to 


r==l/V(V). (12) 

Thus the determination of t at high oxygen pressures enables and to be calculated 
absolutely from equations (10) and (12). 

Measurementofthelifetimewas carried out bytherotatingsectormethod of Briers, 
Chapman & Walters (1926), using a 1: 1 ratio. Since periods of interruption of the 
order of seconds were required, a shutter was used in place of a rotating sector. The 
shutter was operated by a commutator driven by a synchronous motor. The results 



duration of cycle in sec. 

Figube 4. Determination of lifetime by the sector method. 


were analyzed in the manner recommended by Dickinson (see Noyes & Leighton 
1939) and followed the theoretical curve within the limits of experimental error (see 
figure 4). Purther, the lifetimes were proportional to (table 1), showing that the 
life of the excited dye was negligible compared to that of the chain (otherwise the 
measured lifetimes would be given by r -f constant). 


Table 2. Depekdence of lepetimes on intensity 
~d[O 2 ]/d«xl 08 JixlO* 


mol. l.~^sec."^ 

(mol. sec.~^)4 

r (sec.) 

rji X 10^ 

12-95 

6-14 

0-24 

1-47 

4-03 

1-96 

0-86 

1-69 

1-61 

0*76 

2-03 

1-54 


From table 2 and equations (10) and (12) we have 


£4 = 18*3 l.mol.“^sec.”^, = 4T x 10’Lmol.“^sec.~^. 


(13) 



269 


The autoxidaticm of tetralin 


Results at low oxygen peessuees 


According to equation (2), if [ 02 ]; is always proportional to the concentration of 
oxygen in the gas phase, (Oa)^, 

KO^l . b 




dt 


= a+ 


( 0 .)/ 


(14) 


where a, b are constants for a given rate of chain starting. The experimental results, 
however, do not fit equation (14), as will be seen from figure 5. The rate falls off too 
rapidly at low oxygen pressures. Thus either the kinetic scheme (1) must be wrong 
or the solution of oxygen must be rate determining at low oxygen pressures. The 
latter explanation was shown to be correct by experiments with interrupted light. 



Figure 6. Benzoyl peroxide 1%, 25® C. 


At low oxygen pressures the rate was much the same in continuous and intermittent 
light, a fact which can be understood only if solution of oxygen is rate-determining. 
It may be assumed that the rate of solution of oxygen is proportional to the difference 
between the saturation concentration and the actual concentration in the liquid, 
i.e. that 

d{0^ldt = k\s{02)g'~~[0^^, ( 1 ^) 


where k is constant at constant temperature and constant rate of shaking, ^ is the 
absorption coefficient, and d{0^ldt represents the rate of disappearance of oxygen 
from the gas phase, in moles per litre of solution per second. From (14) and (16) it 
is easy to show that 



dt ■ 


(16) 


It can be seen ftom (14) that a is the limiting value of 



at high oxygen 


pressures. Thus the left-hand side of equation (16) can be^lculated in any experi- , 



260 C. H. Bamford and M. J. S. Dewar 

ment, and, by plotting it against - d{0^ldt, a straight line should be obtained from 
which 6 and ifc can be found. Figures 6 and 7 show this plot for thermal initiation 
by benzoyl peroxide and for photo-initiation by Caledon Red BN at 25°. The 
experimental points lie satisfactorily on straight lines. From (2), 

(1'^) 



0 1 2 3 0 1-0 2-0 3-0 


- 10’ (mol. 1.-1 seo.-i) - 10» (mol. l.-i seo.-i) 

dt ctt 

Fiotibb 6. Benzoyl peroxide, 1%, 25® C Fiqtjbb 7. Caledon Bed BN, 25° C 

(/= 1-98 X 10-10) . ( j = 2-26 X 10-0). 



10-VlOJi0.mol.-i) 

Figtjee 8. Benzoyl peroxide 1%, 26° C. 



261 


The autoxidation of tetralin 

and b^I should be constant. The values obtained for this quantity are 3*94 x 10~® 
(figure6) and 3*90 x 10“® (figure?). We may take the value of to be 3*94x lO""® 

at 26° 0, since figure 6 represents the more accurate series of experiments. Using 
the value of k from figure 6, values of [OgJz may be calculated from (15). According 

to equation (14) a plot of — 1 ^ against l/LO^]? should be a straight line. Figure 8 

shows that this is so. Although the straight lines in figures 6 and 8 are consequences 
of the same equations, we consider that b can be found more conveniently from 
figure 8. In this case the value agrees almost exactly with that obtained from 
figure 6. It will be noticed that the magnitudes of k found from figures 6 and 7 are 
not quite the same. The value of k should be sensitive to the experimental conditions, 
e.g. the exact position of the reaction bulb, and much greater variations were 
obtained by altering the rate of shaking. 



Figure 9. Benzoyl peroxide 0*1%, 45° C (I = 3*26 x 10“^®). 

Figure 9 shows a plot of equation (16) at 45° C, the reaction being initiated by 
benzoyl peroxide. Here again the points lie satisfactorily on a straight line. From 
these results we have 

kJJk^ = 2*54 X 10^1.*mol."“*sec.“^ at 25^C,'l 

(18) 

and k^l^k^ == 2*20 x 10^1.^mol.“^sec."l at 45° 0,J 

whence ^ jB'5 =—1*3 kcal. 

The photochemical aeter-eppect 

As stated above, the rotating sector cannot be used under conditions where the 
solution of oxygen is rate-determining. It is, however, possible in principle to use 
the photochemical after-effect to determine the velocity constants absolutely. 



262 


C. H. Bamford and M. j. S. Dewar 

provided that suitable corrections can be applied. It may readily be shown from 
equation (2) that the photochemical after-effect AOj is given by 

in which - d{0^ldt is the rate in the light, and g is the ratio of the rates in the light 
and in the dark. (If the rate in the dark is not sufficiently large, it is convenient to 
supply continuous weak illumination.) The total after-effect measured when the 
light is cut off is 

A'Oj = AOj + [02]j(darli) “ [02]z(llglit) 

= A02 + 5(02)j— [OJjdight) 


from equation (15). In deriving (19) it has been assumed that the concentration of 
oxygen in solution during the time of the after-effect is constant, and equal to its 
value in the light. This is an approximation which will be justified below. 

It was found possible to measure A'Og directly over the whole range of oxygen 
pressures. Values of AO^ could be calculated from (20), since k was known from 
previous measurements. From (19) it follows that a plot of 


against IjlO^ should be a straight line. In place of l/LOg]; it is more convenient to 


plot the equivalent expression (see equation (2)), 



where a and b have 


their previous significance. Figure 1 0 shows this plot for several series of experiments 
at 26° C, in which the rate of initiation was varied by a factor of about 10. All the 
pomts lie satisfactorily on the same straight line. This indicates that the change in 
[ 02 ]j during the after-effect is not serious tmder our conditions, for the error so 
introduced would be very sensitive to the rate of initiation. 

From the slope and intercept of the line in figure 10, and may be found 
absolutely (equation (19)) and hence the other velocity constants. Values are given 
in table 3. The values of and agree quite satisfactorily with those obtained by 
the rotating sector method (equation (13)). There are a number of uncertainties in 
the latter method, and we therefore consider that the values in table 3 are the more 
reliable. 


Table 3. Velocity constants at 25° C (l.mol.-^sec.-i) 


*3 6-76x 10’ ks 7-10 xl0» 

*4 13-3 k, 2-16x10’ 


The after-effects could not be measured satisfactorily at 45° 0, but were always 
unreasonably large. The cause is not clear, but there is some evidence that photo - 
decomposition of tetralin hydroperoxide at 45° C produces very active catalysts 



263 


The autoxidation of tetrcdin 

which, interfere with the measurements. A similar effect probably accounts for the 
results of Burnett & Melville (19476) on the benzoyl peroxide photosensitized 
polymerization of vinyl acetate. Photocatalysts are also formed in the polymeriza- 
tion of methyl methacrylate (Bamford & Dewar 19486). It is reasonable that this 
effect should only occur in our experiments at the higher temperatures, since under 
these conditions the absorption of light by peroxide extends to larger wave-lengths. 



Figure 10. Caledon Bed BN, 26® C J(mol.l."^sec."’^): ©, first series MOx 10“®; 
□, second series 2-08 x 10“®; x , third series 3-00 x 10“®; A» foxirth series 7*40 x 10“®. 


Accordingly, measurements of the after-effect were carried out at 0® C. Measure- 
ments were made only at high oxygen pressures, since it was considered, that 
measurements at low oxygen pressure would not be sufficiently accurate for the 
calculation of the activation energies and Eq to be significant. It wiU be seen later 
that values for these can be estimated in another way. 

Equations (2) and ( 19 ) give, for high oxygen pressures, 

AO, 

log 4 ( 1 +a" *7 ’ ^ ^ 

which may be used directly to obtain E^ — E^- The observed figures were 


and 

whence 


— =: 2-49 X 10""® mol. 1.”^ at 0° C, 

log 4(1 +g) 

, R = 4 - 64 X 10 -«mol.l.-iat 26 °C, 

log 4(1 + g) 

J(4 — = 4- 1 kcal. 


( 22 ) 



264 


C. H. Bamford and M.. J. S. Dewar 


Activation bnbeghes and brequbnov baotoes 

Equations {10]| and (22) give the values for and Erj shown in table 4. Estimates 
of E^ and E^ may be made as follows. From (18), E^ > 2*6 kcaL, since E^ cannot be 
negative. E^ corresponds to the reaction between two tetralyl radicals. The activa- 
tion energy for the reaction between two polystyryl radicals is 2-8kcal. (Bamford 
& Dewar 1948 c). In both cases the radicals are substituted benzyl radicals, and in 
the case of tetrahn the activation energy should if anything be less, since any 
contribution due to diffusion would be small. Thus j&5<2*8kcal. It is therefore 
reasonable to take E^ = 2-6 and E^ = O-Okcal. 


Table 4. Activation energies and ebeqttbnoy eactobs 


^3 = 0-0 kcal. 
jEJ^ = 4*5 kcal. 
J/g = 2*6 kcal. 
= 04 kcal. 


Ag = 6-8 X 10^ 

= 2*5 X 10 ^ 
45 = 5*5x108 
4 : = 4*2 X 10 ’ 


Discussion 

(i) Three papers by Robertson & Waters (1948^,6,0) on the autoxidation of 
tetralin have recently appeared. As their conclusions differ from ours in certain 
respects, some comment seems to be required. 

Themain difference liesin the modeoftermination postulated. Robertson & Waters 
conclude that in the mitial stage of the oxidation termination occurs by the reaction 
of tetralyl radicals and later by the reaction 

TO^+OK-^TOH+O^. (23) 

The main argument appears to be the improbability of termination mechanisms of 
other types. 

In the first place, equation (2) implies that reaction between tetralyl radicals can 
only be important at very low oxygen pressures, for under other conditions the 
concentration of tetralyl is extremely low. This might have been anticipated from 
the work of BoUand (1946). Secondly, the analytical results of Robertson & Waters 
not only can be interpreted in terms of our three termination reactions, but also throw 
some light on their mechanisms, which 4re probably as follows: 




The autoxidation of tetralin 265 

Th6 formation, of dih,yd.ronaph.thalene in the autoxidation at high oxygen pressures 
and the evolution of oxygen in the decomposition of tetraJin hydroperoxide (Robert- 
son & Waters 1948a, 6) provide strong evidence for reaction (24e). Robertson 
& Waters overlooked this possibility and regarded the presence of dihydronaph- 
thalene as evidence for the disproportionation of tetralyl radicals. At very Hi gh rates, 
when the chains are short, allowance would have to be made for this evolution of 
oxygen. In our experiments the correction would never exceed about 3 %. 

It is also evident that the decomposition of tetralin hydroperoxide (Robertson 
& Waters 19486) can be interpreted in a similar way. 

(ii) Although absolute velocity constants have now been determined for a number 
of radical reactions in solution (Swain & Bartlett 1946; Burnett & Melville 1947a; 
Bamford & Dewar 1947a, 1948a, 6, c, and other papers in the press from this 
laboratory), aU these have involved large radicals from polymerization processes. 
The measurements described in this paper seem to be the first on reactions of small 
radicals. It is now possible to make a preliminary survey of the field, and table 5 
summarizes the data at present available for the activation energies and frequency 
factors of radical reactions in solution. The values for the propagation and transfer 
reactions in the polymerizations of st3Tene have been calculated from the data of 
Gregg & Mayo (1947), and Lewis, Walling, Cummings, Briggs & Mayo (1948) using 
our value (1948a) for the propagation constant in the polymerization. The values for 
vinyl acetate were obtained by Dixon-Lewis in these laboratories. 

In general the frequency factors are much lower than the ‘normal’ value for 
bimolecular reactions in solution. This can hardly be a size effect, since the termina- 
tion reactions which involve iwo large radicfils have in fact the highest frequency 
factors, and since the frequency factors in the tetralin reactions are also low. Attention 
was drawn to these points in a previous communication (Bamford & Dewar 1949), 
where it was suggested that the normal frequency factor for radical reactions in 
solution is probably about 10’. 

(iii) The chain lengths were in the range 26 to 600 depending on the rate of 
initiation. The lifetime of a radical at 25° C is 1 -02 x 1 0“^ sec. ; this is of the same 
order as the life of a polystyryl radical in the polymerization of styrene at 25° C 
(6-2 X 10“® sec., Bamford & Dewar 1948 a). The lifetime of a T radical depends on the 
oxygen pressure. At 100 mm. and 25° C it is 2*76 x 10-^sec. It is interesting that the 
life of a kinetic chain is of the order of seconds under our conditions, since it has 
generally "been assumed that the growth of radical chains is always extremely rapid. 
In the thermal polymerization of styrene at 0° C the mean lifetime of the kinetic 
chain is 11 hr. 

(iv) It is interesting to compare the rate of initiation by benzoyl peroxide with the 
published rate of decomposition of the peroxide. From the values quoted by Swain 
& Bartlett ( 1 946) the rate of decomposition of benzoyl peroxide in 1 % solution is about 
10~®mol.l.“^sec.”’^. The observed rate of initiation was 2‘09 x 10“^*mol.l.“^sec.~^. 
Since each molecule of peroxide could theoretically initiate two chains, there is 
clearly a discrepancy of a factor of 10. This might be explained in several ways 
(e.g. primary recombination, non-radical decomposition of the peroxide, or induced 
decomposition of the peroxide by radicals), but in any case it throws doubt on the 



266 C. H. Bamford and M. J. 

S. Dewar 


Table 5 



reaction. koal. 

logic ^ 

polymerization of styrene : 



propagation 

6-5 

6-01 

transfer 

14*2 

7-18 

termination 

2-8 

8-49 

polymerization of vinyl aoetate : 



propagation 

3-0 

5-84 

transfer 

5-9 

3-86 

termination 

0-0 

8-30 

CO -polymerization, reactions of 



polystyryl radicals with : 



methyl methacrylate 

6*0 

6-01 

methyl acrylate 

6*1 

5-90 

diethyl maleate 

7-2 

5-60 

diethyl fnmarate 

5*4 

5-83 

p -chlorostyrene 

6-1 

5-91 

chain transfer reactions of styrene 



with solvents : 



benzene 

21-3 

16-07 

toluene 

16-6 

10-04 

ethylbenzene 

12-0 

4-74 

isopropylbenzene 

12-0 

4-92 

tertbutylbenzene 

20*2 

14*68 

diphenylmethane 

10-2 

3*26 

triphenylmethane 

11-6 

6*25 

fluorene 

9*6 

5-82 

cyclohexane 

19-9 

13-22 

n-heptane 

11-5 

3-67 

ethylene dicMoride 

21-2 

11 - 10 * 

ethylene dibromide 

16-2 

8-48* 

carbon tetrachloride 

10-7 

6 - 88 * 

carbon tetrabromide 

6-7 

6-69* 

autoxidation of tetralin: 




0 

7-82 

^4 

4-6 

4-40 

^6 

2-6 

8-74 

kj 

0-4 

7-62 

* Bamford & Dewar 


validity of work in which the rate of chains starting has been equated to the rate of 

decomposition of the peroxide (cf. Swain & Bartlett 1946; Bolland 1948. The work 
of the former authors will be discussed in this coimexion elsewhere). 

(v) The techniques described in this paper can clearly be extended to other free 
radical reactions, e.g. the addition of halogen compounds to olefines, and work along 

such lines is in progress here. 



, Rbfbrbnobs 



Bamford, C. H. & Dewar, M. J. S. 1947 a Disc, Faraday Soc, 2 , 310. 
Bamford, C. H. <& Dewar, M. J. S. 19476 Disc, Faraday Soc, 2 , 314. 


Bamford, C. H. & Dewar, M. J. S. 1948 a Proc, Roy, Soc, A, 192, 309. 


Bamford, C. H. & Dewar, M. J. S. 19486 Proc. Roy, Soc, 

A, 192. 329. 


Bamford, C, H. & Dewar, M. J. S. 1948 c In course of publication. 




267 


The autoxidation of tetrakn 

Bamford, C. H. & Dewar, M. J* S. 1949 Nature, 163, 266. 

Bolland, J. L. 1946 Proc. Roy, Soc, A, 186, 218. 

Bolland, J. L. 1948 Trans, Faraday Soc, 44, 669. 

Bolland, J- L. & Gee, G. 1946 Trans, Faraday Soc, 42, 236, 244, 

Bolland, J. L. & ten Have, B. 1947a Trans, Faraday Soc, 43, 201. 

Bolland, J. L. & ten Have, P. 19476 Disc, Faraday Soc, 2, 252. 

Briers, F., Chapman, D. L. «&: Walters, E.- 1926 J, Chem, Soc, p. 562. 

Burnett, G. M. & Melville, H. W. 1947a Proc, Roy, Soc, A, 189, 456. 

Burnett, G. M. & Melville, H. W. 19476 Proc, Roy, Soc, A, 189, 474. 

Fischer, F. & Pfleiderer, G. 193a Z, anorg, Chem, 124, 61. 

George, P. 1946 a Proc, Roy, Soc, A, 185, 337. 

George, P. 19466 Trans, Faraday Soc. 42, 210. 

George, P., Kideal, E. K. & Bobertson, A. 1946 Proc, Roy, Soc, A, 185, 288. 

George, P. & Bobertson, A, 1946 a Proc, Roy, Soc, A, 185, 309. 

George, P. & Bobertson, A. 19466 Trans, Faraday Soc, 42, 217. 

Gregg, B. A. & Mayo, F. B. 1947 Disc, Faraday Soc, 2, 328. 

Lewis, F. M., Walling, C., Cummings, W., Briggs, E. B. & Mayo, F. B. 1948 J. Amer, Chem. 
Soc, 70, 1519. 

Noyes, W. A. <fc Leighton, P. A. 1941 Photochemistry of gases. New York: Beinhold. 
Bobertson, A. <& Waters, W. A. 1946 Tranks, Faraday Soc, 42, 201. 

Bobertson, A. & Waters, W. A. 1948 a J, Chem, Soc, p. 1574. 

Bobertson, A. <fe Waters, W. A. 19486 J, Chem, Soc, p. 1578. 

Bobertson, A. & Waters, W. A. 1948 c J, Chem, Soc, p. 1585. 

Swain, 0. G. & Bartlett, P. D. 1946 J, Amer, Chem, Soc, 68, 2381. 

Weber, G. 1948 Trans, Fa/raday Soc, 44, 185. 


The dissociation energy of the N-N bond in hydrazine 
By M. Szwaro, Chemistry Department, University of Manchester 
{Gcmnmnicated by M. 0. Evans, F,R.8. — Received 19 January 1949) 

The thermal decomposition of hydrazine wa^ investigated over a temperature range of 630 to 
780° 0 and at pressures of a few mm. Hg. The experiments were carried out in a flow system, 
toluene being used as a carrier gas. This technique makes it possible to discriminate between 
the heterogeneous decomposition of hydrazine 

3NaH4-^N2 + 4NH8, (1) 

and 2 NjjH 4 -^H 2 +N 8 + 2 NH 8 (2) 

and the homogeneous decomposition to 

N2H4-^2NH2*. (3) 

The NHg radicals produced by the latter process were removed, in the presence of excess 
toluene, by the rapid reaction (4) 

CeH^ . OHa +NH8-> . OHa* ^NH^. (4) 

Thus the rate of formation of dibenzyl measures the rate of reaction (3). The study of the 
stoichiometry of the overall process, of the kinetics of various steps and of the effect of packing 
the reaction vessel led to the conclusion that (3) is a homogeneous, unimolecular gas reaction, 
the rate constant being 4 x 10^* exp (60,000/BP). Assuming that the recombination of NHg 
radicals does not involve any energy of activation, it is foimd that Z)(NH 2 -NH 2 ) = 60 + 3 
kcal./mole. This value in conjunction with the relevant thermochemical data leads to the 
heat of formation of NHj radical as 41 koal./mole and to 104 + 2 kcal./mole. The 

latter is good evidence in support of Gaydon’s value of 225 kcal./m6le for the heat of dis- 
sociation of N 2 . 



268 


M. Szwarc 


Intbodxtotion 

Very few data have been published on the subject of the thermal decomposition of 
hydrazine. Elgin & Taylor (1929) fixst observed the decomposition of this substance 
in a siliea vessel at 250° C, and they found Nj ?nd NHg to be the main products. 
From the relative quantities of Ng and NHj they deduced that the overall process is 
represented by the equation 

3NaH4^Na + 4NH3. (1) 

However, they also reported the formation of small quantities of Hg (about 5 %), 
which indicated that the main process, represented by equation (1), was accom- 
panied by a much slower decomposition corresponding to equation (2) 

2N2H4-^H2+N2-I-2NH3. (2) 

Subsequent work carried out in Taylor’s laboratory by Askey (1930) confirmed 
the overall stoichiometry of equation (1) and revealed that the decomposition of 
hydrazine in a silica bulb at 300° C is a heterogeneous process, which takes place on 
the walls of the reaction vessel and obeys first-order kinetics. The same worker found 
also that the thermal decomposition of hydrazine on a platinum or tungsten wire 
(at 200 and 380° C respectively) produced Ng, H^ and NH3 according to the overall 
equation 

2N2H4^H3-1-N2 + 2NH3. (2) 

Gedye & Allibone (1931) used the thermal decomposition of hydrazine (in a glass 
vessel at 400° 0) as an analytical method for determining the quantities of hy draziue 
in a gas mixture. They also adopted the equation 3N2H4 = N2-t-4NH3. Further 
support for this scheme was provided by Gedye & Rideal (1932), although in one 
sii^le experiment, in which the reaction vessel had been evacuated and sparked for 
some time with a Tesla coil, they observed that the decomposition proceeded 
according to equation (2). 

More recently, in a pax)er published by Birse & Melville (1940), data are presented 
which enable one to calculate the rate constant of the thermal decomposition of 
hydrazine in a silica vessel at 75, 100 and 218° C. From these data we estimate 
the energy of activation of this process as about 18kcal./mole. 

The present investigation was concerned with the thermal decomposition of 
hydrazme in a temperature range of 620 to 770° C, i.e. much higher than those used 
in the previous studies. The essential new feature of the method chosen in this 
inv^tigation lies in the use of toluene as a carrier gas. If hydrazine decomposes 
into two NH2 radicals 

^2 Hi->- 2NH2*, (3) 

then, with hydrazine alone or in the presence of a neutral carrier gas, the NHg 
radicals could react with imdecomposed hydrazine yielding, most probably, NHj 
and N2H3 radicals 

NHg- -1- + N2H3.. 

The subsequent behaviour of the NgHg radicals may lead to a chain process, and at 
any rate the interpretation of the results obtained under such conditions would be 



269 


The dissociation energy of the N-N hond in hydrazine 

complicated and probably ambiguous. In the presence of toluene as a carrier gas, 
however, it might be expected that NHj radicals would be removed by the following 
reaction: 

05H6.CH3 + NH2 .^C«Hb. 0H2. + NH3. (4) 

The stable benzyl radicals formed in this reaction would eventually dimerize, as 
was shown in previous studies (Horrex & Szwarc unpublished; Szwarc 1947, 1948, 
1949); and the mechanism of the decomposition would thus be greatly simplified 
(see, for example, Szwarc 1949). This expectation is borne out by the results reported 
in this paper. 


Experimental 

The experimental technique was essentially the same as that already described 
by the present writer in a paper dealing with the thermal decomposition of ethyl- 
benzene (Szwarc 1949), and accordingly only a brief account of the method need 
be given here. 



The apparatus is shown diagrammaticaUy in figure 1. The vapour of anhydrous 
hydrazine was introduced through a fine capillary Ks into a large excess of toluene 
vajpour. The mixture of both compounds, maintained under a pressure of a few 
mm. Hg was made to flow continuously through a silica tube B, heated by means of 
a cylindrical electric furnace. The gases leaving the reaction vessel were lead through 
a glass tube and glass capillary (both heated by a nichrome winding to about 
80° C) into a smn.11 trap TJ cooled by ice. The non-volatile products (e.g. dibenzyl) 
were deposited in this trap, while the undecomposed toluene and hydrazine passed 
through it and were condensed in a large trap H cooled to about — 80° 0 by a mixture 
of acetone and solid CO^. 


Vol. 198. A. 


18 




270 


M. Szwarc 


The gases produced in the decomposition (H2, N2 and NH3) were continuously 
removed by a system of two mercury diffusion pumps and and pumped through 
a trap E cooled with Hquid air into a storage vessel 8 . Ammonia was frozen out in 
the trap E, and thus the amount of Hg+Ng formed in the pyrolysis was estimated 
by measuring the pressure at the end of the run.* Thereafter part of the H2+N2 
mixture was admitted into an analytical device in which the percentage of 
could be estimated; the remaining gas being pumped off. The liquid air surrounding 
trap E was removed and replaced by an ice-water bath. The ammonia condensed 
in E evaporated and the quantity obtained was estimated by measuring the pressure 
in a known volume. The gas was shown to consist entirely of ammonia by absorbing 
it in dilute sulphuric acid. 

The trap H contained all the undecomposed hydrazine, toluene, and small quan- 
tities of dibenzyl, which passed through trap U, In order to estimate the amount of 
hydrazine the contents of the trap E were extracted with dilute sulphuric acid and 
the aqueous layer was analysed for hydrazine by titration with HIO3 in acid solution 
(Bray & Cuy 1924). To determine the quantity of dibenzyl the toluene was removed 
by distillation in vacuo at ice temperature and the residue was weighed. The weight 
of this residue was added to the weight of the bulk of the dibenzyl given by the 
increase in the weight of trap C 7 . The accuracy of this determination was about 
1 to 2 mg. The solid in trap U was identified as dibenzyl by its melting point, the 
unpurified material melting at about 50 ® C (m.p. of dibenzyl 51 ® C). 

The material used 

Anhydrous hydrazine was prepared from a commercial 60 % solution of hydrazine 
hydrate according to the method described by Wenner & Beckman (1932). Analysis 
of the final product showed it to contain 98 to 99 % of anhydrous hydrazine. 

Toluene was prepared as in previous studies by the repeated pyrolysis of ‘ sulphur 
free’ toluene at 810 to 820 ® C followed by careful distillation of the pyrolyzed 
material. Blank experiments showed that the thermal decomposition of toluene 
could be neglected when compared with the decomposition of hydrazine at the 
temperatures used in this investigation. 

The results ajstd their interrretatiok 

The decomposition of hydrazine in these experiments produced the following 
products: Hg, Ng, NH3 and dibenzyl. In the experiments carried out at higher 
temperatures the formation of small quantities of CH^ was also observed. Table 1 
illustrates the results, giving the percentage of Hg in the Hg + Ng mixture, the molar 
ratio of NH3 to Hg-hNg, and the molar ratio of dibenzyl to NH3. The following 
points should be noted: 

1. The formation of Hg and Ng suggests that even at the high temperatures used 
in the present research there was a considerable amount of heterogeneous decom- 
position of hydrazine on the walls of the reaction vessel. 

* pimping out of the trap B was continued for half an hour, after the flow of toluene 
andhydrazine had been interrupted. The tap was then closed and the pressure of the collected 

&SISS. TTlAajeni'nari 



The dissociation energy of the N-N hond in hydrazine 271 

2. It seems that both modes of heterogeneous decomposition mentioned in the, 
introduction participate in the actual process: 

3N.,H,->N2+4NH3 (1) 

and 2N2H4^H2+N2 + 2 NH 3 . ( 2 ) 

The occurrence of reaction (2) is shown by the appearance of amongst the products 
of decomposition. The fact that the amount of Nj always exceeded that of Hj is 
regarded as evidence for the participation of reaction (1). 

3. The observed increase in the percentage of as the temperature of decom- 
position was increased indicates that the reaction (2) has a higher energy of activa- 
tion than reaction (1). This explains why previous investigators, who worked at 
much lower temperatures, observed mainly reaction (1). 

4. The marked increase, with increasing temperature, of the molar fraction of 
dibenzyl in the decomposition products indicates that the process by which it is 
formed has a high energy of activation. We assume that dibenzyl is produced as 
a result of the following three reactions: 

'N2H,^2NH3., (3) 

CeHs . CHg + NHj. ^ CgHs . CH^. -h NHj, (4) 

and ■ 2C,H5.CH2.^CeH6.CH2.CH2.CeHB. ( 6 ) 

This assumption is crucial for our further conclusions, and its justification will be 
fully discussed later. 

If we assume that the decomposition of hydrazine takes place according to * 
equations (1), (2) and (3), the latter process being followed, in the presence of an 


Table 1 


rtin 

T{^K) 

%H, 

56 

894 

14 

55 

896 

16 

15 

900 

25 

17-18 

903 

22 

20 

951 

32 

10 

955 

32 

22 

955 

32 

27 

995 

40 

53 

1004 

32 

38 

1007 

31 

51 

1008 

35 

26 

1008 

38 

31 

1009 

35 

30 

1010 

35 

5-6 

1044 

40 

7-8 

1047 

44 

59 

1050 

35 

57 

1053 

40 

3-4 

1057 

42 


NHj/iHj-t-Na) dibenzyl/NHs 


3*06 

0*0072 

3'34 

0*0185 

2*56 

0*031 

2°41 

0*023 

2-14 

0*096 

2*28 

0*131 

2*30 

0*097 

203 

0*127 

2*57 

0*127 

3-06 

0*20 

2-98 

0*15 

2*40 

0-17 

2-62 

0*155 

2°84 

0-16 

2-24 

0-25 

2*10 

0*26 

2-98 

0*24 

2*84 

0*25 

2*56 

0*25 


This table contains only a few of tbe thirty -two results obtained in the region of 1000 to 
1010° K. 


i8-a 



272 


M. Szwarc 


excess of toluene, by reactions (4) and (5), we are able to calculate the amounts of 
ITH3 produced and of hydrazine decomposed from a knowledge of the quantities of 
Hg, N2 and dibenzyl formed in the pyrolysis. In this scheme all the hydrogen is 
assumed to be produced by reaction (2) . The diiOFerence'between the observed amounts 
of N2 and Hg is taken as a measure of the quantity of Ng produced by reaction (1). 
The amount of dibenzyl formed should be the same as the quantity of hydrazine 
decomposed according to reaction (3) and it should be also equal to half the number 
of moles of NH3 formed by reaction (4). This method of interpretation is illustrated 
by calculations from the results of two experiments, one chosen from the lowest 
and the second from the highest temperature pgion. 


Experiment 55. T = 896° K 


observed amount in 


NECj calculated in 

N 2 H 4 decomposed 

millimoles 

equation of decomposition 

millimoles 

in miOimoles 

Hji 0-13 

2 ]SrjH 4 = H 8 +Nii + 2 NH 3 

2x0-13 = 0*26 

2x0*13 = 0*26 

Ns 0-68 =0-13 +0-55 

1 

3N-3H4 = N3+4NH3 
fNaH 4 = 2 NH 3 ; 

4x0-55 = 2*20 

3x0*55 = 1*66 

dibenzyl 0'06 \ 

2 C.H 3 .CH,+ 2 ]SrH 3 
[ = 2 NH 3 + dibenzyl 

2x0-05 = 0*10 

total = 2*56 

observed = 2*70 

lx 0-05 = 0*05 

total = 1*96 


Experiment 59. T = 

1050° K 


observed amotint in 


NH 3 calculated in 

N 2 H 4 decomposed 

millimoles 

equation 'of decomposition 

millimoles 

in millimoles 

Hj 0-26 

2]Sr3H4=H3+N3 + 2 NH 3 

2x0*26 = 0*50 

2x0*25 = 0-60 

Ifa 0-43 = 0-26+ 0-18 

3N3H4 = N 3 + 4NH3 

4x0*18=0*72 

3x0*18 = 0*54 

CH^ 0-04 

1 

rN,H*= 2 NH 3 

— 

— 

dibenzyl 0-48 J 

2 C,H 3 .CH 3 + 2 NH 3 
[ =2NH, + dibenzyl 

2x0*48 = 0*96 

total =s2*18 

observed = 2*02 ^ 

1x0*48 = 0*48 

total =1*62 


Table 2 contains the observed quantities of H2, Ng, CH^, dibenzyl and NH3 and 
the calculated quantities of NH3. The agreement between the observed and cal- 
culated quantities of NH3, given in the last two columns of table 2, provides a strong 
argument in favour of our assumptions. Further evidence favouring our interpreta- 
tion is provided in tables 3 and 4. The third column of table 3 contains the quantities 
of hydrazine decomposed calculated by the method explained above. Column 4 
of this table gives the quantities of undecomposed hydrazine, recovered from the 
trap H, and determined by direct titration. The last column contains the quantities 
of hydrazine introduced (referred to later as Q^) equal to the sum of decomposed 
and recovered hydrazine. Owing to technical diSiculties it was not feasible to esti- 
mate the quantity of hydrazine introduced in each individual run. It was possible, 
however, to estimate the total amount for a set of about 4 to 15 runs by direct weigh- 
ing of the hydrazine reservoir. Table 4 contains data showing the total amount of 
hydrazine introduced in each of several sets of runs. These quantities are compared 
with the obtained by summing up the corresponding values from the last 
column of table 3. The agreement seems to be satisfactory. 



The dissociation energy of the N-N hond in hydrazine 273 


Table 2 


The quantities are given in millimoles 


run 

TCK ) 

Ha 

Na 

56 

894 

0-10 

0*63 

55 

896 

0-13 

0*68 

15 

900 

0-22 

0*66 

17-18 

903 

0-22 

0*78 

20 

951 

0*29 

0*62 

19 

955 

0*29 

0*61 

22 

955 

0*265 

0*56 

27 

995 

0*26 

0*36 

53 

1004 

0*16 

0*33 

52 

1006 

0*15 

0*33 

38 

1007 

0*16 

0*33 

50 

1007 

0*15 

0-33 

54 

1006 

0*14 

0-42 

51 

1008 

0*18 

0*32 

45 

1000 

0*18 

0*30 

37 

1004 

0*15 

0*29 

47 

999 

0*14 

0*35 

26 

1008 

0*68 

1*06 

31 

1009 

0*15 

0*27 

32 

1002 

0*37 

0*58 

30 

, 1010 

0*16 

0*28 

49 

1006 

0*20 

0*35 

25 

1010 

0*85 ' 

1*25 

24 

1008 

0*52 

0*79 

33 

1010 

0*24 

0*38 

36 

1002 

0*26 

0*42 

28 

1007 

0*32 • 

0*47 

35 

1006 

0*30 

0*50 

29 

1011 

0*515 

0*745 

48 

1008 

0*24 

0*40 

46 

1003 

0*19 

0*33 

40 

1007 

0*21 

0*34 

39 

1006 

0*21 

0*36 

60 

1007 

0*24 

0*43 

61 

1009 

0*21 

0*41 

62 

1009 

0*14 

0*49 

63 

1015 

0*19 

0*41 

64 

1007 

0*16 

0*45 

65 

1010 

0*29 

0*69 

66 

1007 

0*20 

0*57 

5—6 

1044 

0*41 

0*58 

7-8 

1047 

2*26 

2*70 

59 

1050 

0*25 

0*43 

57 

1053 

0*30 

0*42 

3-4 

1057 

0*46 

0*56 




CH ^ 

dibenzyl 

obs . 

calc . 

— 

0*016 

2*23 

2*35 

— 

0*05 

2*70 

2*56 


0*07 

2*25 

2-34 

— 

0*055 

2*41 

2-79 

— 

0*187 

1*95 

2-27 

— 

0*27 

2*06 

2*40 

— 

0*185 

1*90 

2*08 

0*02 

0*16 

1*26 

1*14 

0*01 

0*16 

1*26 

1*32 

0*01 

0-20 

1*30 

1*42 

0*02 

0*30 

1*50 

1*60 

0*01 

0*17 

1*25 

1*36 

0*01 

0*23 

1*77 

1*86 

0*01 

0*22 

1*49 

1*36 

0*01 

0*21 

1*20 

1*26 

0*01 

0*20 

1*25 

1*26 

0*01 

0*13 

1*40 

1*34 

0*05 

0*71 

4*19 

4*30 

0*01 

0-17 

1*10 

1*12 

0*02 

0*40 

2*28 

2*38 

0*01 

0*20 

1*25 

1*20 

0*01 

0*20 

1*21 

1*40 

0*04 

0*63 

4*50 

4*56 

0*04 

0*51 

2*84 

3*14 

0*02 

0*16 

1*35 

1*36 

0*025 

0-17 

1*63 

1*50 

0*02 

0-29 

1*50 

1*82 

0-02 

0-14 

1*68 

1*68 

0*025 

0*42 

2*76 ' 

2*79 

0*01 

0-39 

1*78 

1*90 

0*01 

0-34 

1*58 

1*62 

0*01 

0*325 

1-49 

1*59 



0*38 

1*62 

1*78 



0*09 

1*40 , 

1*42 



0*09 

1*45 

1*40 



0*18 

1*60 

2*04 



0*12 

1*30 

1*50 



0-13 

1-45 

1*74 

■_ 

0*18 

2*70 

2*54 



0*12 

2*05 

2*12 

0*03 

0*55 

2*22 

2*60 

0*20 

2*73 

10*4 

11*8 

0*04 

0*48 

2*02 

2*18 

0*04 

0-52 

2*04 

2-12 

0*09 

0*64 

2*60 

2*22 



274 


M. Szwaro 


Table 3 




run 

TCK) 

decomp. 

(millimoles) 

recovered 

(imdecomp.) 

(millimoles) 

Qc 

introduced 

(millimoles) 

56 

894 

1-81 

5-80 

7-61 

55 

896 

1*96 

8-10 

10-06 

15 

900 

1*87 

9-31 

11-18 

17-18 

903 

2-17 

10-44 

12-61 

20 

951 

1-76 

4-75 

6-51 

19 

955 

1-82 

6-50 

8-32 

22 

955 

1-59 

6-60 

8-19 

27 

995 

0-98 

1-91 

2-89 

53 

1004 

0-99 

1-04 

2-03 

52 

1006 

1-04 

1-34 

2-38 

38 

1007 

M3 

1-52 

2-65 

50 

1007 

1-01 

0-87 

1*88 

54 

1006 

1*35 

1-20 

2-55 ' 

51 

1008 

1-00 

1-15 

2*15 

45 

1000 

0-93 

M5 

2-08 

37 

1004 

0-92 

1-96 

2-88 

47 

999 

1-01 

1-37 

2-38 

26 

1008 

3-21 

5-50 

8-71 

31 

1009 

0-83 

0-86 

1-69 

32 

1002 

1-77 

3-62 

5-39 

30 

1010 

0-88 ' 

M5 

2-03 

49 

1006 

1-05 

1-52 

2-57 

25 

1010 

3-53 

4-80 

8-33 

24 

1008 

2-^6 

4-00 

6-36 

33 

1010 

1-06 

1-50 

2-56 

36 

1002 

1-17 

1-74 

2-91 

28 

1007 

1-38 

2-18 

3-56 

35 

1006 

1-34 

MO 

2-44 

29 

1011 

2-14 

2-78 

4-92 

48 

1008 

1-35 

1-80 

3-15 

46 

1003 

1-14 

1-80 

2-94 

40 

1007 

1-14 

0-18 

1-32 

39 

1006 

1-25 

0-24 

1-49 

60 

1007 

1-14 

0-68 

1*82 

61 

1009 

Ml 

0-15 

1-26 

62 

1009 

1-51 

0-59 

2-10 

63 

1015 

1-16 

0-22 

1-38 

64 

1007 

1-32 

0-40 

1-72 

65 

1010 

1-96 

0-42 

2*38 

66 

1007 

1-69 

0-29 

1-88 

5-6 

1044 

1-88 

Ml 

2-99 

7-8 

1047 

8-57 

4-90 

13-47 

59 

1050 

1-52 

0-57 

2-09 

57 

1053 

1-46 

0-61 

2-07 

3-4 

1057 

1-82 

0-69 

2-51 



275 


The dissociation energy of the N-N bond in hydrazine 

Table 4 

introduced NjHj introduced 


rmis 

exper. 

(millimoles) 

theor. SQc 
(millimoles) 

17-20 

29*2 

27-4 

27-40 

40-0 

38*9 

45-57 

46-5 

43-9 

60-66 

12-0 

12-5 


The appearance of CH4 merits some farther discussion. Previous work, as well 
as the blank tests performed during the present work, proved definitely that the 
observed quantities of CH4 could not be attributed to the thermal decomposition 
of toluene. It could be suggested that CH4 is produced by the following reactions; 

NH2- + CeH5.CH3->CeH5.NH2 + CH3., (6) 

CH3. + CeHs . CH3 ^ CH4 + CeHg . CH3.. ( 7 ) 

To test this suggestion the toluene, recovered from high temperature experiments, 
was extracted first with water, in order to remove hydrazine, and then with dilute 
hydrochloric acid. The latter solution was then diazotized and coupled with ^ 
naphthol in the usual way. The appearance of the red colour indicated the presence 
of fl uilin e in the original condensate. It was also shown that this colour was not 
developed if a mixture of toluene and hydrazine that had not previously been 
pyrolyzed was used. 

However, judging from the intensity of the red colour the amount of anihne 
present was only about 1/10 of that which could be expected on the basis of the 
quantity of CH4 produced. Taking the amount of anilme determined experimentally . 
as a measure of occurrence of reaction (6) the relative probabihties of reactions ( 4 ) 
and (6) were estimated, from the quantities of dibenzyl and aniline produced, to be 
at least 200 : 1 

C*H3.CH3.+NH3, ( 4 ) 

C3H3.CH3 + NH3< 

^OeH5.NH2+CH3.. (6) 

Assuming that reactions ( 4 ) and (6) have equal steric factors one is able to calculate 
on the basis of the above relative probabilities the difference in the activation 
energies The obtained value of about 10kcal./mole is obviously too high 

for reactions of this type. It is likely, therefore, that these probabilities are deter- 
mined by steric factors and not by the differences in the activation energy of the 
two processes. 


The KIKETIOS of the DEOOMPOSmON of HyUEAZINE 

It was stated by Askey (1930), that the heterogeneous decomposition of hydrazine 
obeys first-order kinetics and we assume that this is true also iu our experiments. 
Furthermore, we consider that the rate of formation of dibenzyl, in the presence of 
a large excess of toluene, is governed by the rate of the reaction 

N 2 H 4 -> 2 NH 3 ., 


( 3 ) 



276 


M. Szwarc 


which is also assumed to be a first-order process. The successful interpretation of 
the experimental data on the basis of these assumptions will be considered as 
evidence for their valichty. 

Using this kinetic scheme we are able to calculate the composite rate constant of 
the overall decomposition, -K^totau constant of the hydrazine decom- 
position proceeding according to equation (3), denoted by ^sing the 

following formulae: 


K — ^ amount of N2H4 introduced 

t amount of 1^23^4 undecomposed ’ 

1 A(total N2H4 decomposed) 


A(]Sr2H4 decomposed homogen.) A(N2H4 decomposed heterogen. ) 


(N2H4) [At At 

therefore 

jr ^ jr amount of N2H4 decomposed according to equation (3) 
NaH* total amount of N2H4 decomposed 

— K amount of dibenzyl formed 
*°^Hotal amount of N2H4 decomposed’ 


t denotes the time of contact and is calculated from the measured rates of evapora- 
tion of toluene and of hydrazine, and the pressure and temperature in the reaction 


vessel. The quantity 


amount of dibenzyl formed 
total amount of N2H4 decomposed 


will henceforth be described 


as the ‘fraction of homogeneous decomposition’. The necessary data: amounts of 
^^2^4 introduced, of undecomposed (recovered) N2H4 and total amounts of N2H4 
decomposed, are tabulated in table 3; the amounts of dibenzyl formed are given in 
table 2. Table 5 shows the values of X^otai calculated in this way, the ‘fraction of 
homogeneous decomposition ’ and for experiments carried out at tempera- 
tures near lOOS^^K. The last column of this table contains the values of 
re-calculated for an arbitrarily chosen temperature of 1008° K. This re-calculation 
involved a short extrapolation which was carried out on the basis of 60kcal./mole 
.as an activation energy for the process of formation of dibenzyl; a value obtained 
by plotting log^jsj^gi;^ against IfT (see table 6 and figure 2). The following con- 
clusions may be drawm from table 5 : 


1. Although the partial pressure of hydrazine varied from 0*05 to 0-78 mm. Hg, 
the pressure of toluene varied from 5 up to 15 mm. Hg, and the time of contact varied 
from 0*26 up to 1*1 sec., there* does not appear to be any trend in the values of 
These results justify our assumption concerning the first-order kinetics of 
the overall process. ^ 


2. The packing of the reaction vessel (see runs 60 to 66), which increased the 
surface about 2^ times, caused a roughly proportional increase in Ztotai- The 
average A[^totai twenty-three experiments carried out in the unpacked reaction 
vessel is 2*1 sec, while the average for seven experiments in the packed reaction 



The dissociation energy of the N-N bond in hydrazine 


■ 277 


«Ho I e0C35 00e505i-ICCilOlO00C3SOO05«0l>iO'rHO©5Q0W500’^l>CCI> i>*l> 00 <M CO 
o '^iHCO XO’^'«^lOlpCOCOCO'^'<*<rHCOCOCOCOCO'T*<COCO l£5«D 

5s S 6666666666666666666 66 66 6666666 

K 2 3' 


'1 OOt^ rHOOCOrH’Tj<i-t|>QOOCOCqi>OOI>C*-t^OOOfi<l >00 CO-^ C<IOOOOT*<l>>Of-» 

coco ^0'^'<J^>0'?HCOCaoO>OCOt^^OOCOCOCOC<^COfiOr^^ >O>0 C0 '^'«!J<>O'<#>OtH 

o 66 6666666666666666666 66 66 6666666 


OQOOOcOOOO'«^<CO-?ffOOOrHUOOTHO 00 00 OSOOCOOCCNIi-K 

y l>0 >OOI><MC<l«NC<IG<IO<MC<|Ot^rHlC'^i-lOO 0003 COO l>00rHOO3O3t> 

'O fHi-H eqMi-IC^Oi|<Mf-HC<|(MCq<?qrHrH<NrHr-<©^i-HW CMC^ CMCO OOfHrHOOO 

I § 66 6666666666666666666 66 66 6666666 


_g'. 0^03 OC»l>COO'^»7»I>>0>Jp-7lOrHI>'^OOOpGprH 00 yf OOO<Nl>O0p 


Ch 

P S'^OO OOt^I>I>Ot'l>l>t^C^l>l>Ot'<yiOOI>OOI> 0303 >0 >a>O<MlOC0O3eq 

<045 O coco G<* 0 ^(MO:iCO<MC!|O^N(Ne^|C^^©^Cqe^(MO^C^ rHffsl (Ncococoeocqco 

I g S 66 6666666666666666666 66 A A 6666666 


iii3 

m M W iSl003O*>OC0C0C0OOl>03tHC0i-t03V300 lOO COWS OC<J>OOCOOCO 

3 S . rHM OOOfH(HC<»(N(?qcq0iJCS|C<l©!|C0C0>£5>01>l>xHxH 

p3 Cl B o6 6666666666666666666 66 66 6666666 


|W 03 05 oc<ieao5icoG<icoocorHco<Noeo>pcp'^io<N cpdte ioioc^c»i>>oco 

^ A '4l^ rHrH 66 666l>l>l>6 


W ^co l>C^^OOOTH03 00 03<?^000000<Nt>COrH OOCO ^-CD I>03 05>i0i>0l> 
OO OOOOO603OOOi-HOrHO'-HOOOr-( OO OO OOOrHOi— lO 
0 OO 00000003000000000000 OO OO OOOOOOO 


CON 00O'^'-<>Ol>I>O'-<NO03>O'^C0O00i003 OOCO 003 o«hcmcotH>oco 
> o6 C066>O'^CO'^Jt<NCQC0C0^N<MC0C0NC0<M 'tUCO O CQ CO t® ^ CO <£> 


18 SP 
ts o ,S 

s i §. 

WlaJ 


p ^ ay 

M 8| 
s E ■s s 


I “ 

c2 S3 
'o 3 .§ 

© r^y 

ss s 


Jill 1^-11 



278 M. Szwarc 

vessel is 6-1 sec.“^. This is a direct proof that processes (1) and (2) are indeed hetero- 
geneous. * 

3. The values for re-calculated for 1008° K are even more reproducible 

t.ha.Ti Ztotai- They do not show any trend either with the change of partial pressure 
of hydrazine (varied by a factor of 18), or with the change in time of contact. This 
justifies our second assumption, namely the first-order kinetics of the process of 
formation of dibenzyl. There is a slight increase in increase in toluene 

pressure (compare runs 63 and 62 with 48 and 46), but these variations are not of 
sufficient magnitude to be considered important. 

4. The packing of the reaction vessel seems to be without any influence on the 

This observation strongly suggests that the rate determining process in the 
formation of dibenzyl is a homogeneous gas reaction. It is interesting to note that 
the random variations in ^totai much greater than the variations in 
This seems to be an indirect indication of the heterogeneous nature of processes (1) 
and (2) and of the homogeneous nature of reaction (3). 

Table 6 


time fraction 

of of homo- 


run 

T(°K) 

■^toluene •^NaH4 

(mm. Hg) (mm. Hg) 

contact 

(sec.) 

(sec.~^) 

geneous 

decomp. 

(sec.’^^) 

E 

(koal./mole) 

66 

894 

7*0 

0-10 

0-33 

0-82 

0-009 

0-007 

61-1 

55 

896 

7*0 

0-16 

0-35 

0*61 

0-026 

0-015 

69-9 

16 

900 

7*0 

0-15 

0-30 

0-61 

0-038 

0-023 

59-4 

17-18 

903 

7-0 

0-11 

0-30 

0-61 

0-024 

0-015 

60-3 

20 

951 

7*0 

0-44 

0-29 

1-10 

0-106 

0-12 

59-6 

19 

955 

6-8 

0-16 

0-28 

0-89 

0-148 

0-13 

69-7 

22 

955 

7-9 

0-42 

0-29 

0-74 

0-116 

0*09 

60-4 

27 

995 

7*1 

0-46 

0-27 

1-6 

0-163 

0-25 

60-9 

average at 

-7*0 

— 

-^0-27 

2-1 

0-20 

0-41 

60-7 

5-6 

1008 

1044 

7*1 

0-19 

0-27 

3-5 

0-29 

1-0 

61-0 

7-8 

1047 

7-1 

0-16 

0-27 

3-8 

0-32 

1-3 

60-6 

59 

1050 

7-0 

0-10 

0-27 

4-8 

0-32 

1-6 

60-4 

57 

1053 

7-1 

0-07 

0-26 

4-7 

0-34 

1*6 

60-5 

3-4 

1057 

7*0 ' 

0-19 

0-27 

4-8 

0-33 

1-6 

60-8 


Having presented evidence in support of our assumptions concerning the kinetics 
of the reaction, we can now discuss the problem of activation energy. Table 6 
contains the values of J^totaiJ ‘fraction of [homogeneous decomposition’ and 
for four temperature regions from 900 to 1060° K. There is a small increase in iTtotau 
but a very marked increase in the ‘fraction of homogeneous decomposition’ and 
^ column of table 6 contains the values of activation energy corre- 

sponding to -S^N 2 H 4 calculated on the assumption that the frequency factor is 
5x lO^^sec.”^. This particular value of the frequency factor was chosen in con- 
sideration of the results obtained in a previous investigation (Szwarc 1949). The 
plot of log^j^^g;* against l/T is given in figure 2. The best straight line corresponds 
to an activation energy of 60 ± 3kcaL/mole and a frequency factor of 4 x 10^^ sec.~^. 



279 


The dissociation energy of the N-N bond in hydrazine 

The agreement between the experimental frequency factor obtained from figure 2 
and that which is expected for a unimolecular process on theoretical grounds 
(10^^ to lO^^sec."*^) confirms that represents the rate constant of the homo- 
geneous, unimolecular gas reaction. 



0-950 1‘000 1*050 MOO M50 

lO^xl/r 

Figtob 2. JS = eO kcal./mole. i'=4 x sec.”^ O single experiment, 

© average of nineteen experiments. 

Discxtssion 

It is now desirable to sum up and review critically the evidence which leads us to 
the final conclusion that represents the rate constant of the homogeneous, 

unimolecular reaction 

N2H,->2NH2-. (3) 

The agreement between the observed and calculated quantities of NH3 (table 2), 
and between the observed and calculated quantities of hydrazine introduced 
(table 4) provides an adequate justification for our method of computation. The 
latter leads to the conclusion that 1 mole of dibenzyl and 2 moles of NH3 are formed 
as a result of the decomposition of 1 mole of hydrazine. 

There are three possible schemes which might account for this stoichiometry:. 

A. The scheme suggested previously consisting of a sequence of reactions (3), 
(4) and (5). 

B. The scheme based on a reaction 

CeH^ . CH3 + NHg . C,I£^ . CH^. + NH3 -h NH^-. 



280 


M. Sziwarc 


C. The scheme based on a reaction 

CeH5.CH3+]m2.NH2+CH3.CeH5->CeH5CH2- + NH3 + NH3 + -CH2C6H5. 

The scheme C can be rejected, since the termolecular process is extremely im- 
probable and it would require an increase of J^N 2 H 4 proportional to the square of 
the toluene pressure, in contradiction to our observations (see table 5). The scheme B 
requires a proportionality between pressure of toluene. Table 5 does 

reveal a slight increase of with increase of the toluene pressure, but this is 
smaller than would be required by the scheme B. The main opposition to this scheme 
is provided by the experimentally observed frequency factor and activation energy. 
To account for the observed values it would be necessary to assume the radius of 
the activated complex to be at least 700 A (taking the steric factor as 1). Our present 
knowledge indicates that the steric factors for these types of reactions are much 
less than unity (e.g. Glasstone, Laidler & Eyring 1941 ), which would require an even 
greater radius for the activated complex. Thus we conclude that the scheme A, i.e. 

N3H4^2NH2., ‘ (3) 

CeHg. CgHs. CH^- +NH 3 , (4) 

2CeH3. OeHg. CH^. OeH^, (5) 

is the only one which accounts for the observed stoichiometry. Kiuetically the 
process was found to be a homogeneous, jSrst-order gas reaction, with a frequency 
factor of the magnitude required for a unimolecular decomposition. These character- 
istics must be those of reaction (3), since reaction (4) should be fast (by analogy with 
similar processes), and the kinetics of reaction (6) are irrelevant for these con- 
siderations. 

It could be argued that the experimental activation energy is the sum of the 
activation energies involved in reactions (3) and (4). This could be true only if a very 
small proportion of the NHg radicals react according to equation (4) whereas most 
of them are destroyed by a wall reaction or by recombination either with themselves 
or with benzyl radicals. To investigate the last possibility, toluene, recovered from 
an experiment in which partial pressure of hydrazine was kept very high (0*8 mm. 
of Hg) and the decomposition was about 85 %, was extracted with wafer to remove 
undecomposed hydrazine and was then shaken with water-acetone solution of sodium 
nitro-prusside (Rimmini test for primary aliphatic amines). The test gave a positive 
result* (red-violet coloration), and a rough colorimetric estimate determined the 
quantity of benzylamine as 5 to 10 mole per cent of the available NHg radicals. 
The same test repeated with toluene recovered from an experiment in which the 
partial pressure of hydrazine was 0-5 mm. of Hg gave only 2 to 5 mole per cent of 
benzylamine. It is concluded, therefore, that the recombination of NHg and benzyl 
radicals in these experiments is only a minor reaction which can be safely neglected. 

The mutual recombination of NH 2 radicals and their decomposition were studied 
by investigating the pyrolysis of benzylamine reported in the following paper. It 
was found that in the presence of toluene there is no mutual recombination of NH 2 
radicals, and that their decomposition is only a minor side reaction. 

* Riixu n ini test is given neither by aniline nor by or N2H4. 



281 


The dissociation energy of the N-N bond in hydrazine 

Thus the final conclusion is reached: that under the conditions of the experiments 
reaction (4) is the only reaction consuming Nllg radicals which needs to be taken 
into account for the kinetic calculations. 

The dissociation energies of the N-N bond in hyd/razine and of the 
first N-H bond in ammonia 

Having obtained the activation energy of reaction 

N2H4^2NH2., (3) 

one should be able to calculate the dissociation energy of the N-N bond in hydrazine 
knowing the activation energy of the reverse process, i.e. the recombination of NH2 
radicals. It is assumed generally that the recombination of radicals or atoms does 
not involve any activation energy. The case of NH2 radicals, hovrever, may be an 
exception. Heitler & Rumer (1931) concluded on the grounds of a theoretical 
computation that the interaction between two NHj radicals leads to a repulsion 
for large N-N separation. Such a repulsion results in the appearance of a hump in 
the potential energy curve, corresponding to an activation energy in the process of 
recombination. However, in the same paper these authors explicitly stated their 
doubts regarding the physical reality of this hump. They pointed out that their 
treatment neglected the dipole-dipole attraction, which could well overcome the 
repulsion effect calculated. It seems, therefore, that it is justifiable to neglect the 
activation energy of NH2 recombination, and thus the dissociation energy of the 
N-N bond in hydrazine can be evaluated as 60 ± 3 kcal./mole. This appears to be 
a high value, particularly if we recall the 20koal./mole advocated by Pauling (1945) 
as the average bond energy of the N-N bond. It should be noted, however, that 
Pauling’s system of bond energies was formulated on the basis of 170kcal./mole 
for the heat of dissociation of Nj, while the choice of 226 kcal./mole for this heat of 
dissociation leads to a value of about 43 kcal./mole for the average N-N bond energy 
(Skinner 1945). 

Further support for our value of the N-N dissociation energy in hydrazine can be 
adduced by considering the dissociation energy of the first N-H bond in ammonia, 
which can be calculated as follows. The heat of formation of gaseous, anhydrous 
hydrazine was estimated by Hughes, Corruccini & Gilbert (1939) at -i-22kcaI./mole, 
and the heat of formation of NH3 has been reported as — 11 koal./mole (Bichowsky 
& Rossini 1936). Using these values, together with I)(NH2-NH2) = 60 kcaL/mole, 
a value of -f- 41 kcal./mole was obtained for the heat of formation of the NHg radical, 
firom which the dissociation energy of the first N-H bond in ammonia was estimated 
at 104 ± 2 kcaL/mole. This value seems to be reasonable, as it would be expected to lie 
between the dissociation energies of the.first 0-H bond in HgO and the first C-H bond 
in CH4. Dwyer & Oldenberg (1944) estimated D(HO-H) at 118 kcaL/mole, whilst 
Kistiakowsky & van Artsdalen (1944) found the D/CHs-H) to be 101 kcaL/mole. 


D(O-H) 

100 koal./mole 
D(HO-H) 
118 koal./mole 


Table 7 

D(N-H) 

86 kcal./mole 
D(NH2-H) 
104 kcal./mole 


D(C-H) 

80 kcal./mole 

101 kcal./mole 





282 


M. Szwaro 


It seems also to be reasonable to expect that the values of the dissociation energies 
of diatomic radicals OH, NH and OH would follow the same sequence. On the basis 
of this assumption Glockler {1948) interpolated for D(N-H) a value of about 
85kcal./mole, taking D(O-H) as lOOkcal./mole (Dwyer & Oldenberg 1944) and 
D(C-H) as 80kcal./mole (Herzberg 1939). These data are summarized in table 7. 
Furthermore it is to be expected that the average N-H bond energy in NH3 would 
be of the order 

i[D(N-H)+Z»{NH2-H)] = i(86 + 104)kcal./mole, 

i.e. about 96kcal./mole. Assuming the heat of dissociation of N2 to be 225kcal. /mole 
(Gaydon 1944) one obtains for the average N-H bond energy in NH3 a value of 
93kcal./mole (Skinner 1945), which is similar to that deduced above. On the other 
hand, the calculation based on Herzberg’s value of the heat of dissociation of N^ 
(170kcal./mole) leads to the average N-H bond energy of ammonia of 84kcal./mole 
only. It is concluded, therefore, that the value of i)(NH2-H), as obtained in this 
investigation, is in harmony with Gaydon’s estimate of the heat of dissociation 
of Nj. Very similar arguments, applied by Glockler (1948), also favoured Gaydon’s 
value of the heat of dissociation of Ng. 

Discussion of the, heterogeneous decomposition of hydrazine 
' It has already been mentioned that two modes of heterogeneous decomposition 

( 1 ) 
( 2 ) 


It seems that these reactions occur by the direct decomposition of activated com- 
plexes A and B formed on the quartz surface of the reaction vessel. Evidence has 
been presented in this paper which shows that participation of atoms or radicals in 
these processes is very unlikely. The formation of radicals or atoms would be 
expected to produce dibenzyl by a secondary reaction with toluene: 

i?. + OgHs . CH3 ^ EH + CsHg . CH2-. 

2CjH 5 . CH2* dibenzyl. 

If this were the ease the quantity of dibenzyl produced should be increased by 
packing the reaction vessel, which is contrary to our observations. 

Birse& Melville (194*^) investigated the heterogeneous decomposition of hydrazine 
on the walls of the reaction vessel in the presence of para hydrogen. They found that 
the decomposition did not cause any para-ortho hydrogen conversion. As the con- 


of hydrazine have been observed in this investigation: 

3N2H4^N2 4-4NH3, 

and 2NaH4^H2-l-N2-l-2NH3. 


NH, -f T- NH, 

■R’y VH 


NH2 -7^ 


NH, 


H 

NH, 


NH, 

-V' 'yH 

’“f — 4— 

H H 



283 


The dissociation enetgy of the N-N hond in hydrazine 


version is extremely sensitive to the presence of radicals, these results, in agreement 
with our observations, point also to the absence of radicals or atoms in the hetero- 
geneous decomposition of hydrazine.* 

The formation of the proposed activated complexes, particularly of type -4, must 
correspond to a very low probability factor and, therefore, the decompositions via 
such transition states would only be observed if their activation energies were very 
small. There was described earlier in this paper the calculation of the activation 
energy of the overall process at 8kcaL/mole, using the data provided by Birse & 
Melville. The data presented here have made it possible to estimate the rate con- 
stants of the individual reactions (1) and (2). In order to do this one calculates the 
quantities of N’2H^ decomposed in each experiment according to equations (1) 
and (2). The former is given by and the latter by where 

and JfHa denote the number of millimoles of nitrogen and hydrogen collected in 
each case. The method of calculation is illustrated by means of the examples given 
on p. 272. The rate constants of the reactions (1) and (2) are given: 

h — k q^Q^^tity of N2H4 decomposed according to (1 ) 

1 total total quantity of N2H4 decomposed ’ 


kn — hi 


'total 


quantity of decomposed according to (2) 
total quantity of N2H4 decomposed 


Table 8 shows the values of k^^ and h^ calculated in this way for various temperatures. 
The last column of this table contains the ratio The reproducibility of the 
results is not very satisfactory but this is not surprising since the walls of the reaction 
vessel could be expected to change their total activities, and still more their relative 
activities towards the competing reactions, with time and use. We may recall in this 
connexion the observations of Gedye & Eideal who found a change in the character 
of the decomposition of hydrazine on a glass surface which had been sparked pre- 
viously with a Tesla coil. 

Table 8 





h 

k ^ 


run 

T(®K) 

(seo.~^) 

(secl”^) 

(sec,~^) 

kilk 2 

56 

894 

0-82 

0-72 

0-09 

8-0 

65 

896 

0-61 

0*51 

0-08 

6*4 

15 

900 

0-61 

0*43 

0-16 

2-7 

17-18 

903 

0-61 

0-47 

0*12 

4-0 

20 

951 

1*10 

0*62 

0-36 

1-7 

19 

965 

0*89 

0*47 * 

0*28 

1-7 

22 

955 

0-74 

0*41 

0*25 

1-6 

27 

996 

1-6 

0-47 

0'81 

0-6 

average at 1008 ' 

2-1 

-^ 0*8 

CO 

6 

1 

^ 1-0 

5 — 6 

1044 

3-5 

1-0 

1-5 

0-7 

7-8 

1049 

3*8 

0-6 

2*0 

0*3 

59 

1050 

4*8 

1*7 

1-6 

1-0 

57 

1053 

4*7 

M 

1*9 

0-6 

3-4 

1057 

4*8 

0-8 

2-4 

0-3 


* In a private communication Professor H. W. MelviUe was kind enough to explain to the 
present writer that the conversion of ^3 .H 2 observed at 200® C seemed to be due entirely to the 
production of the normal H 2 from the hydrazine itself. 



284 


M. Szwarc 


The lack of reproducibility makes the estimates of the various activation energies 
very crude. However, the inspection of table 8 shows quite clearly that reaction (1) 
corresponds to an activation energy smaller than that of reaction (2), and from the 
change of the ratio JcJJc 2 with temperature a rough estimate gives 10 kcal. 

The change of and k^ with temperature indicates activation energies of the order 
of ^ 10 and ^20 kcal. respectively for processes (1) and (2). 


Refbeenoes 

Askey, P, J. 1930 J. Amer. Chem, Soc, 52, 970. 

Bichowsky, F. B. & Rossini, F. D. 1936 The thermochemistry of chemical svhstances. New York : 
Reinliold. 

Birse, E. A. B. & Melville, H. W. 1940 Froc, Boy, Soc, A, 175, 164, 

Bray, W. C. & Cuy, E. J. 1924 J. Amer. Chem. Soc, 46, 858. 

Dwyer, R. J. & Oldenberg, 0. 1944 J. Chcm. Phys, 12, 351. 

Elgm, J. C. & Taylor, H. S. 1929 J* Amer. Ghem. Soc. 51, 2059, 

G^ydon, A. D. 1944 Nature, 153, 407. 

Gedye, G. R. & AUibone, T. E. 1931 Froc. Boy. Soc. A, 130, 346. 

Gedye, G. R. & Rideal, E. K. 1932 J. Ghem. Soc. pp. 1160. 

Glasstone, S., Laidler, K. & Eyring, J. 1941 • The theory of rate processes. New York : McGraw- 
Hill. 

Glockler, G. 1948 J. Ghem. Fhys. 16, 602. 

Heitler, W. & Ruiner, G. 1931 Z. Fhys. 68, 12. 

Herzberg, G. 1937' Chem. Bev. 20, 145. 

Herzberg, G. 1939 Molecular spectra and molecular structure. New York: Prentice Hall Inc. 
Hughes, A. M., Corruccini, R. J. & Gilbert, E. C. 1939 J. Amer. Ghem. Soc. 61, 2639. 
Kistiakowsky, G. B. & van ArtsdalCn, E. B. 1944 J. Ghem. Fhys. 12, 469. 

^945 The nature of the chemical bond, p. 53. Ithalca, New York : Cornell University 

Skinner, H. A. 1945 Tranks. Faraday Soc. 41, 645. 

Szwarc, M. 1947 Nature, 160, 403. 

Szwarc, M. 1948 J. Ghem. Fhys. 16, 128. 

Szwarc, M. 1949 Ghem. Fhys. 17, 431. 

Wenner, R. R. & Beclonan, A. 0. 1932 J. Amer. Ghem. Soc. 54, 2787. 



The dissociation energy of the C-N bond in benzylamine 
By M. Szwaeo, Chemistry Department, University of Manchester 
{Communicated by M. G. Evans, F.B.S. — Received 19 January 1949) 


The kinetics of the thermal decomposition of benzylamine were studied by a flow method 
using toluene as a carrier gas. The decomposition produced NHg and dibenzyl in a molar ratio 
of 1 : 1, and small quantities of permanent gases consisting mainly of Hg. Over a temperature 
range of 150° (650 to 800° C) the process was found to be a homogeneous gas reaction, 
following iirst-order kinetics, the rate constant being expressed by 

A; = 6 X 10^2 exp (59,000 1 BT) seo.~^ 

It was concluded, therefore, that the mechahism of the decomposition could be represented 
by the following equations; 

OeHg . CHa . CeHs . CH,. + 

OaHe . CHa +NHa.->CeH, . CHa* +NH3, 

2 CeH 5 . 0 Ha.-^ dibenzyl, 

and the experimentally determined activation energy of 59 ± 4 kcal./mole is equal to the 
dissociation energy of the 0-N bond in benzylamine. 

Using the available thermochemical data we calculated on this basis the heat of formation 
of the IJTHa radical as 35*5 kcal./mole, in a fair agreement with the result obtained by the 
study of the pyrolysis of hydrazine. 

A review of the reactions of the NHa radicals is given. 


Inteoduotiok 

In view of the complex character of the thermal decomposition of hydrazine 
described in the preceding paper, it was desirable to carry out some analogous 
investigations which might be expected to elucidate still further the previously 
discussed topics. The study of the pyrolysis of benzylamine provided such an 
opportunity. It made it possible to investigate further the fate of the NH 2 radicals 
and their reactions with toluene. Moreover, this study furnishes the data for a 
calculation on a new basis of the heat of formation of the NH 2 radical and the 
dissociation energy of the first N-H bond in ammonia. The good agreement with the 
values reported in the previous paper leads to an increased measure of confi-dence 
in these results. 

Bxpeeimbntal 

- The apparatus used in the present work was identical with that described in the 
previous communication. The only difference in the technique, as compared with 
that employed in the case of hydrazine, was in the method of estimation of the 
quantities of benzylamine introduced into the system. This was done by direct 
weighing of the benzylamine reservoir before and after individual experiments. It 
was noted that benzylamine exposed to air behaved abnormally during the sub- 
sequent pyrolysis. To avoid this difliculty the commercial benzylamine was distilled 
in vaouo before use, the middle portion being collected directly into the reservoir. 
The latter consisted of a small glass bulb provided with a sohd key tap and a ground 
joint. After collecting the required fraction the tap on the receiver was closed 


Vol. 198. A 


[ 286 ] 


19 



286 


M. Szwarc 


before the pumping was interrupted, thus keeping the distilled benzylamine free 
from air. During an experiment the reservoir was attached to the apparatus by 
means of the ground joint and the solid key tap was opened only after the evacuation 
of the connexions. At the end of each experiment the bulb was immersed in an 
acetone-solid COg bath, and the solid key tap was closed after the residual benzyl- 
amine vapour had been condensed back into the reservoir. Thus it was possible to 
weigh the reservoir without contaminating its contents with air. 

Ammonia and dibenzyl, in molar ratio of 1:1, were the main products of the 
thermal decomposition of benzylamine in the presence of excess of toluene vapour. 
Other products of the decomposition consisted of small quantities of permanent 
gases, their amounts being about 20 % of the amount of ammonia produced. It was 
shown that the permanent gas contained mainly Hg (about 70 %). 

The toluene recovered from the pyrolysis was tested occasionally for the presence 
of hydrazine. This was done by shaking it with water, and running the aqueous 
layer into a solution of HlOg which contained some starch. In the presence of 
hydrazine, HlOg is reduced to I^ and the latter would manifest itself by the appear- 
ance of the blue colour, with starch. All the tests proved to be negative showing 
the absence of hydrazine in the decomposition products. 

Table 1 



run 

K) 

time of 

-^toluene -Pbenzylamine contact 
([mm. Hg) (mm. Hg) (sec.) 

%NH3 

A;(8ec.“i) 

% per- 
manont 
gas 

% <ii- 
benzyl 


62 

922 

7*1 

0-25 

0*29 

1-7 

0*07 

28 




64 

930 

6-9 

0-15 

0*28 

2-6 

0*09 

19 

— 


53 

937 

7-1 

0-18 

0*29 

3*0 

0-11 

22 




49 

966 

7-2 

0-20 

0*27 

7-0 

0-27 

22 

116 


47 

967 

15-5 

0-12 

1-10 

31*6 

0*34 

30 

97 


46 

968 

15-4 

0-16 

MO 

30-6 

0-33 

22 

no 


48 

969 

7-3 

0-20 

0*28 

7-6 

0-28 

22 

98 

packed 

31 

1008 

7-3 

0-18 

0-28 

30*0 

1-26 

20 

80 


44 

1005 

15-4 

0-08 

1-05 

72-6 

1-2 

23 

85 


38 

1009 

6-7 

0-15 

0*28 

27-0 

1-1 

17 

87 


39 

1009 

6-7 

0-16 

0-28 

25*0 

1-0 

17 

81 


40 

1010 

6-7 

0-16 

• 0*28 

27-0 

1-1 

18 



packed 

29 

1010 

7-6 

0-14 

0-28 

25*0 

M 

22 

93 

packed 

30 

1010 

7-6 

0-17 

0*286 

30*0 

1-2 

20 

97 


41 

1012 

7-3 

0-05 

0*28 

22-5 

0-9 

31 

— — 


42 

1012 

7-5 

0-05 

0-28 

23-0 

0-9 

26 




43 

1012 

15-4 

0-10 

1*07 

71-5 

1-2 

29 




61 

1035 

7-4 

0-15 

0-27 

47-5 

2-4 

20 

86 

packed 

50 

1052 

7-5 

0-16 

0-27 

62-5 

3-6 

20 

87 

33 

1064 

7-6 

0-17 

0-26 

67-0 

4-3 

31 

91 


35 

1066 

7-8 

0-17 

0-27 

87-6 

7-7 

22 

88 

p€M3ked 

36 

1066 

7-9 

0-15 

0-28 

86-0 

7-0 

22 

89 

32 

1070 

7-3 

0-16 

0-256 

77-5 

5-86 

29 

78 


The results are summarized in table 1 ; the last column of which gives the quan- 
tities of dibenzyl actually isolated, expressed as a percentage of the yields of dibenzyl 
calculated from the measured amounts of ammonia, adopting the ammonia : di- 
benzyl ratio as 1 : 1. Assuming that each mole of ammonia formed corresponds to 



287 


The dissociation energy of the C-N hmA in 'benzylamine 

1 mole of benzylamine decomposed one finds that in the kinetics of the decom- 
position of this compound a first-order law obtains. This was established by a fourfold 
variation of both the partial pressure of benzylamine and the time of contact (see 
table 1 and figure 1). Moreover, packing of the reaction vessel, which increased the 
surface by a factor of about 2^, proved that the investigated process was a homo- 
geneous gas reaction (see figure 1). The plot of log h against l/T over a temperature 
range of about 140° (from 926 up to 1070° K) is given in figure 1, and from this 
we estimated' the activation energy of the decomposition as 59±4kcal./mole and 
the frequency factor as 6 x 10^*sec.~^. 



Fioubk 1. O unpacked reaction, vessel; x packed reaction vessel. 
■ .F = 59 ± 4 koal./mole. ^=6 x 10“ seo.~^ 


Disotrssiosr 

It is beyond doubt that the weakest bond in benzylamine is the C-N bond and, 
therefore, the initial decomposition step should be dissociation into benzyl and NHj 
radicals, 

OeHs.CHa.NHa-^CjHs.OHa-FNHa.. (1) 

In an excess of toluene the NBEj radicals would be rapidly removed from the system 
by the reaction 

CgHs.OHa+NHa.^CgHg.CHa.-HNHa. (2) 

The benzyl radicals thus formed would' dimerize giving dibenzyl. This proposed 
mechanism demands the formation of 1 mole of dibenzyl for each mole of ammonia 
produced, and the results tabulated in table 1 are in agreement with this condusion. 



M. Szwaxc 


288 

Reaction (1 ) is the rate-determining step of the overall process, hence, the kinetics 
of the formation of ammonia should reveal the characteristics of a unimoleoular, 
homogeneous gas reaction. This conclusion is borne out by the experimental results. 
The formation of ammonia was found to be a homogeneous, first-order process; 
moreover, the experimental frequency factor is of the order of the theoretically 
predicted Jfrequency factor for unimoleoular reactions (Polanyi & Wigner 1928). 
We conclude, therefore, that the observed activation energy of 69 ± 4kcal./mole 
is that of reaction (1). 


The. dissociation energies 

Ma.TriTig the usual assumption of zero activation energy for the recombination of 
radicals one can conclude that the observed activation energy of 69 ± 4kcal./mole 
represents the dissociation energy of the C-N bond in benzylamine. Two observa- 
tions, one reported by Hurd & Oamham (1930) and the other communicated privately 
by DrE. Warhurst & Mr B. Gowenlock, indicate that the dissociation energy of the 
bond in question must be quite considerable. 

According to Hurd & Camham the heating of benzylamine for several hours in 
a sealed tube at 300° C caused no decomposition whatever, and moreover this com- 
pound remained almost unchanged when heated for 64 see. at 636° 0. It may be 
deduced from these results that the dissociation energy of the 0-N bond in benzyl- 
amine should be greater than 63 kcal./mole. 

Some preliminary unpublished results of Warhurst & Gowenlock suggest a still 
higher value for the dissociation energy of this bond. These workers pyrolyzed 
benzylamine using a technique similar to that described by Butler & Polanyi (X943). 
The extent of the decomposition was measured by the amount of ammonia formed, 
the latter being estimated by Nessler’s reagent. The experiments carried out with 
Ng as a carrier gas at 490 and 630° 0, using a time of contact of about 0-6 sec., 
resulted in only slight decomposition of the order 0-1 %. Thus it was concluded by 
the above authors that the dissociation energy of the C-N bond in benzylamine must 
be greater than 67 kcal./mole. 

Knowing the dissociation energy of the bond in question we are able to calculate 
the heat of formation of the NHg radical using in addition the heat of formation of 
benzylamine in gaseous state and the heat of formation of the benzyl radical. The 
latter value is known to be 37’5 kcal./mole from the estimated dissociation energy 
of the C-H bond in toluene (Szwarc 1948) in conjunction with the relevant thermo- 
chemical data, namely the heat of formation of toluene and the dissociation energy 
of Hg. Unfortunately, there is no recently measured value for the heat of combustion 
of benzylamine. The best value seems to be that of Petit (1889) which was chosen by 
Kharash in his compilation (1929). The heat of vaporization of benzylamine is not 
available from the literature but by analogy with other a,Tuinft Fi a value of 11 keal./ 
mole has been taken for this quantity. Thus one derives the heat of formation of 
gaseous benzylamine as 14 kcal./mole. 

We are now m a position to calculate the heat of formation of the NHg radical, 
using the above data, and the value of 36’6 kcaL/mole thus obtained is in good 
agreement with that derived in the foregoing paper (41 kcal./mole). It should be 



289 


The dissociation energy of the C-N bond in benzylamine 

noted that the first value involves an error of about 5keal./mole, neglecting the 
uncertainty involved in the heat of formation of benzylamine, whilst the second is 
uncertain within 2kcal./mole. 

Similar calculations enable us to find the dissociation energy of the C-N bond in 
methylamine. The heat of formation of the methyl radical is 31kcaL/mole, cal- 
culated on the basis of 101 kcal./mole as the dissociation energy of the first C-H 
bond in methane. Taking the heat of formation of the NHj radical as 41 kcal./mole 
we calculate the dissociation energy of the 0-N bond in methylamine as 77 kcal./mole 
(using the heat of combustion of gaseous methylamine as estimated by Thomsen 
1905), or as 81 kcal./mole (using the heat of combustion determined by Muller 1910). 
It is most unfortunate, that there are no recent data for the heats of formation of 
various amines. 

The fair agreement between the two values of A£^(NB[2) as reported in the pre- 
ceding and the present paper provides additional confirmation for the suggested 
value of Z)(NH2-H) = 104 ±2 kcal./mole. It is desirable, therefore, to review the 
evidence contributed by other workers to the problem of the dissociation energy of 
the first N-H bond in ammonia. 

The pre-dissociation spectrum of NH3 observed in the region of 2300A by Bon- 
hoeffer & Farkas (1928) indicates that thei)(NH2-H) must be less than 124kcal./mole 
(Bonhoeffer & Harteck 1933). The results of the Hg photo-sensitized decomposition 
of .NH3 and NDg obtained by Melville (1935) lead this author to the conclusion that 
mercury atoms are quenched by NH3 to state, and the collision between the 
latter atoms and NH3 causes it to dissociate into an H ’atom and an NHa radical. In 
this case the upper limit for Z)(NH2-H) is brought down to 112 kcal./mole, ignoring 
the possible formation of HgH. Terenin & Neujmin (1935) investigated the photo- 
decomposition of NH3 caused by the Schumann ultra-violet radiation. The 
decomposition was accompanied by the appearance of the emission band ascribed 
by these authors to the excited NHg radicals. The overall process was assumed to be 

NHg+hi^-^H-hNHf. 

I The observed energy threshold was 172kcal. (1660 A), and the observed excitation 
energy was 66 kcal./mole. Thus it was cohcluded that i)(NH2-H) is less than 
117 kcal./mole. 

Further information about the value of ^(NHj-H) could be furnished by the 
estimation of the activation energies of and Ej, of the respective processes 

H-l-NHs-^NHg-t-Hg, (a) 

NHg-hHg^NHg-fH. (6) 

The difference Ea-Ej, = I>(NH2-H)-D(H-H). E^ was estimated by Dixon (1932) 
as about . 8-6 kcal./mole. This result is based on the collision yield of the reaction 
between H atoms and ammonia. It seems that roughly the same value should be 
ascribed to E^, as the reaction between NHg radicals and hydrogen does not take 
place at room temperature (e.g. Birse & Melville 1940). It is possible, therefore, that 
Z>(NH2-H)~i)(H-H) = 104kcal./mole. 



290 


M. Szwarc 


The fate of the NH2 radicaU 

The absence of hydrazine in the products of the pyrolysis of benzylamine in- 
dicates clearly that under the conditions imposed in the above experiments the 
mutual recombination of the NH2 radicals does not take place. However, the forma- 
tion of hydrazine by the dimerization of the NH2 radicals was observed by several 
workers, and we shall shortly review their results. 

Bredig, Koenig & Wagner (^929) reported that hydrazine was formed as a result 
of an electric discharge through ammonia. The yield of hydrazine increased with the 
velocity of the NH3 stream flowing through the discharge tube. Later experiments 
of Koenig & Brings (1931) revealed that the yield could be increased up to 94 % by 
cooling the flowing system to — 80 ^^ 0 . This dependence of the yield of hydrazine 
formation on the rate of flow and the temperature seems to indicate that the NHg 
radicals were removed from the discharge tube and subsequently dimerized, most 
probably on the cooled walls. 

The formation of hydrazine in the photo-decomposition of ammonia in a flow 
system was observed by Gedye & Rideal (1932), who noticed also a very marked 
increase in the yield of hydrazine when lowering the temperature. 

The formation of hydrazine in the photo-decomposition of ammonia in a static 
system was found beyond doubt only by Weldge & Beckman (1936). These workers 
noticed that the permanent gases produced by the photo-decomposition of NH3 
contained more than 75 % of provided the percentage of decomposition was kept 
very low. They interpreted this as a result of the formation of NaH4, and proved the 
correctness of their assumption by detecting minute quantities of N2H4 adsorbed 
on the walls of the reaction vessel. However, the accumulation of was prevented 

by its decomposition in the course of the reaction, and, therefore, the percentage of 
H2 in the products was decreasing to 76 % as the percentage of the total decom- 
position was increasing. This decomposition of hydrazine was the generally accepted 
explanation for its absence in the products of the photo-decomposition of NH3 in 
a static system. 

There is a second way of interaction between the NH2 radicals, namely 


NH 2 +NH 2 ^N 2 -h 2 H 2 . ( 3 ) 

It was stated by Gedye & Rideal (1932) that the role this reaction plays in the process 
of consumption of the NHg radicals increases rapidly with rise in temperature. 

Wiig & Edstiakowsky (1932) deduced that reaction (3) cannot occur as a homo- 
geneous gas reaction. Nevertheless, it seems very plausible that this reaction can 
occur as a heterogeneous reaction taking place on the walls of the reaction vessel. 
The adsorption of the NH2 radicals on the surface of silica was demonstrated by 
Wiig (1935) who found that the quantum yield of the photo-decomposition of 
ammonia decreased rapidly for very small pressures of NH3. This effect is due to the 
diffusion of the NH2 radicals to the walls, their subsequent adsorption on the silica 
surface, and finally their recombination with the H atoms. These observations were 
later confirmed by the investigation of the influence of the size of the reaction vessel 
on the quantum yield (Wiig 1937). 



The dissociation energy of the C-N hond in henzylamine 291 


Birse & Melville (1940) developed an elegant technique (rotating sector) which 
allowed them to study the behaviour of the NEg radicals. They proved that the ISfEg 
radicals have a very long lifetime, being able to stirvive on the walls of the reaction 
vessel for a period as long as 70 sec., and found that the decay proceeds according to 
equation ( 3 ). It should be emphasized that this reaction is quite different from the 
heterogeneous decomposition of hydrazine which leads to the formation of NH3, 
Ng and smaller quantities of H^. Thus the ‘normal’ N2H4 molecule is not an inter- 
mediate in reaction ( 3 ). It may be, however, that a ‘hot’ molecule of hydrazine is 
formed prior to its decomposition according to equation ( 3 ). At sufficient low 
temperatures the rate of the transfer of energy from this ‘hot’ molecule to the wall 
is great enough to stabilize it before the decomposition takes place. Thus one can 
explain the formation of increasing quantities of hydrazme when the temperature 
is lowered. 



i H H ! 

L_| |..J 

N N 
{ H HI 


II 


There are two mechanisms by which reaction ( 3 ) can proceed. One is based on the 
transition state I, whilst the other assumes the transition state II. The latter is 
analogous to the transition state B, postulated for the heterogeneous decomposition 
of hydrazine (see the preceding paper). There is no direct information that allows 
discrimination between these two possibilities. I am, however, inclined to favour the 
transition state II by comparison with the results obtained by MelviUe, BoUandifc 
Eoxbrough (1937) in their investigation of the PHj radicals. These workers had 
shown that the decomposition of the equimolecular mixture of PH3 and PDg pro- 
duced Ha, HD and Dg. Further, they proved that the mechanism of formation of 
HD did not involve H or D atoms. Thus the reaction must include the process 

PH2+PD2^Pa-l-2HD. 

Although these results proved the occurrence of the transition state II, the above 
authors did not exclude the possibility of the reaction 

PHa-nPDg^Pa-FHa+Da 

proceeding via transition state I. However, if the latter step is to be included ia the 
mechanism of the reaction, then the quantity of HD produced should be smaller 
than that expected for the equilibrium mixture of Hg — HD — Dg; although the above 
authors explicitly stated that the Hj-HD-Dg equilibrium was completely 
established. 

In the light of all these observations it is not surprising that hydrazine was not 
detected under the experimental conditions of the present investigation. On the 
other hand, it could be argued that the small quantities of the permanent gases, 
formed during the pyrolysis of henzylamine, resulted from the decomposition of the 
NHg radicals as previously discussed. However, this assumption would require 



292 


M. Szwarc 


that an increase in the quantity of the permanent gases formed should result from 
the packing of the reaction vessel which contradicts the actual observations. It 
was ascertained by special experiments that ammonia is not decomposed in the 
temperature range used in this study. These results excluded the possibility of 
produciag the permanent gases by the decomposition of NH3. There is no satis- 
factory explanation that can be offered for the process of formation of the permanent 
gases. In any case it r^resents only a minor side-reaction, and it does not change 
the calculated activaMon energy, as the percentage of these gases in the products 
remained practically constant for all temperatures. In consequence I have not 
attempted to investigate further this phenomenon. 


References 

Birse, E. A. B. <& Melville, H. W. 1940 Proc, Boy, Soc. A, 175, 164. 

BonhoejSer, F. K. & Farkas, L. 1928 Z. phys. Ghem, A, 134, 337. 

Bonhoeffer, F. K. & Harteck, P. 1933 Grundlage d, photochemie, p. 128. Dresden: Theodor 
Heinkopff. 

Bredig, G., Koenig, A. & Wagner, 0. H. 1929 Z. phys, Ohem. A, 139, 211. 

Butler, E. T. & Polanyi, M. 1943 Trans. Faraday Soc. 39, 27. 

Dixon, J. K. 1932 J. Amer. Ghem. Soc. 54, 4262. 

Gedye, G. R. & Rideal, E. K. 1932 J. Ghem. Soc. p. 1160. 

Hurd, C. D. & Camham, F. L. 'I930 J. Amer. Ghem. Soc. 52, 4151. 

KKarash, M. S. 1929 Bur. Stand, J. Res.y Wash., 2, 359. 

Koenig, A. & Brings, T. 1931 Z. Phys. Ghem. Bodenstein Festband, p. 541. 

Melville, H. W. 1935 Proc. Boy. Soc. A, 152, 325. 

Melville, H, W., Bolland, J. L. & Roxbrough, H. L. 1937 Proc. Boy. Soc. A, 160, 406. 
Muller, T. A. 1910 Ann. Ghim. (Phys.), (8), 20, 116. 

Petit, P. 1889 Ann. Ghim. {Phys.), (6), 18, 149. 

Polanyi, M. & Wigner, E. 1928 Z. phys. Ghem. 139, 439. 

Szwarc, M. 1948 J. Ghem. Phys. 16, 128. 

Terenin, A. <fc Keujmin, H. 1935 J. Ghem. Phys. 3, 436. 

Thomsen, W. 1905 Z. phys. Ghem. 52, 343. 

Weldge, H. J. & Beckman, A. O. 1936 J. Amer. Ghem. Soc. 58, 2462. 

Wiig, E. O. 1935 J. Amer. Ghem. Soc. 57, 1559. 

Wiig, E. O. 1937 J. Amer. Ghem. Soc. 59, 927. 

Wiig, E. O. & Kistiakowsky, G. B. 1932 J. Amer. Ghem. Soc. 1806. 



The work of the National Institute for Medical Research 

By Sib Chablbs Habptgtok, F.R.S. 

Director of the Institute . 

Lecture delivered 10 March 1949 — Received 21 March 1949.) 

[Plates 9 and 10] 

In the National Insurance Act of 1911 there was contained a provision which has 
proved to be of great importance for scientific work in this country. This provision 
laid down that the sum of one penny per insured person should be provided from 
public funds for the purposes of research. The total income resulting amounted to 
£55,000 in the first year of the operation of the Act, and it was for the administration 
of this sum and for the decision as to the purposes to which it should! be put that the 
Medical Research Committee was appointed in 1913. 

The first report of the Committee, which appeared late in 1915 over the signature 
of the first secretary, Walter Fletcher, and was submitted to the Chairman of the 
National Health Insurance Joint Committee, contained a declaration of policy 
which is of such fundamental importance to the conduct of medical research that it 
seems to me to be worth quoting in fuU at this time. In defining their objects the 
Committee made the following statement: 

‘The object of the research is the extension of medical knowledge with the view 
of increasing our powers of preserving health and preventing or combating disease. 
But otherwise than that this is to be the guiding aim, the actual field of research is 
not limited and is to be wide enough to include, so far as may from time to time be 
foimd desirable, all researches bearing on health or disease, whether or not such 
researches have any direct or immediate bearing on any particular disease or class 
of diseases; provided that they are judged to be useful in promoting the attainment 
of the above object.’ 

I shall have occasion later to refer to researches which have been carried out at 
the National Institute for Medical Research, and which, although they have proved 
in the outcome to be of importance to medicine, were in their inception purely 
scientific inquiries. It is clear that only within the framework of the almost 
academic freedom claimed in the statement of pohey which I have read could such 
researches have been undertaken in an Institute for Medical Research. 

Having laid down their general policy in these terms the Medical Research 
Committee proceeded to consider the steps to be taken for its implementation; the 
methods decided on were four in number, namely, (1) the employment of full-time 
investigators in centralized laboratories, (2) the appointment of full- and part-time 
investigators in hospitals and elsewhere for special researches, (3) the provision of 
assistance grants to workers in academic institutions engaged in independent 
researches in the general field of interest, and (4) the maintenance of a statistical 
department. The individual importance attached to the last of these items must be 


Vol. 198. A. (22 August 1949) 


[ 293 ] 


20 



294 Sir Charles Harington 

ascribed to the obvious bearing of statistical work on the problems raised by national 
insurance; actually this item was merged with the first, which itself constituted 
a decision to establish a research institute. 

Very soon after the decision had been taken to set up a central research institute, 
which, incidentally, was to be paid for out of the first year’s income of the Medical 
Research Committee, negotiations were started for the purchase of the Mount 
Vernon Hospital, Hampstead, with its freehold site and grounds, at the price of 
£S5,000, it being considered, in the words of the report, that these premises offered 
‘exceptional advantages in amenity and convenience’ besides being readily con- 
vertible into research laboratories ; in addition, it was at this stage intended that 
part of the building should be used as a research hospital of fifteen to twenty beds. 

Whilst these negotiations were in progress a proposal was made to the Medical 
Research Committee by the Governing Body of the Lister Institute, that the latter 
should, subject to the agreement of the members and to satisfactory provision for 
the existing staff, be handed to the nation for use as the central research institute 
of the Committee. The importance which was attached by the Medical Research 
Committee to the research hospital part of their project is shown by the fact that 
the first check in the negotiations with the Governing Body of the Lister Institute 
arose because the latter possessed no hospital; this difficulty was overcome by the 
generous offer of Lord Iveagh to build a fifty-bed hospital on a site adjoining the 
Lister Institute. Moreover, the Medical Research Committee were prepared to take 
into their employment the whole of the existing staff of the Lister Institute, 
However, when the proposals were finally submitted to the members of the Lister 
Institute for approval they were rejected and the scheme came to nothing. 

So long as the result of the negotiations between the Medical Research Committee 
and the Governing Body of the Lister Institute remained in doubt it was naturally 
impossible to proceed with the work of converting. the buildings at Hampstead; the 
result of this delay was that the outbreak of war in August 1914 found these 
bu il dings untouched, and in October of that year it was decided to hand them over 
to the War Office for use as a military hospital. Thus the control of the Hampstead 
premises passed out of the hands of the Committee . for five years, although much 
work was done on these premises during that period in which the Medical Research 
Co mmi ttee had a direct interest and in which members of their staff took a leading 
part. When the Mount Vernon Hospital was handed to the War Office it was 
selected for the study of cardiac disorders — ^the so-called ‘ soldier’s heart ’ — and the 
direction of this work was undertaken by Thomas Lewis, who was seconded to 
Hampstead from University Cdllege Hospital, where he was already working as 
a full-time member of the Committee’s scientific staff. The work on cardiac disorders 
was continued at Hampstead until 1917 when it was transferred to Colchester, and 
the Mount Vernon Hospital then became a Central Hospital for Flying Officers; 
a considerable amoimt of work was done on the special physiological and medical 
problems raised by the stresses of flying, and in this again the Medical Research 
Committee took their part. 

In preparation for the staffing of the central institute the Medical Research 
Committee had already, during 1914, made certain key appointments for the three 



The work of the National Institute for Medical Research 295 

main divisions of the work which were at that time envisaged. These appointments 
were those of Sir Almroth Wright and S. R. Douglas in bacteriology, of H. H. Dale, 

G. Barger and A; J. Ewins in pharmacology and biochemistry, and of Leonard Hill, 
Benjamin Moore and Martin Flack in applied physiology; the department of 
statistics, which was also to be housed in the central institute, was put in the charge 
of John Brownlee. Since the Hampstead building was not available these workers 
had to be accommodated elsewhere, and during the war the bacteriologists were 
based on St Mary’s Hospital, although much of their time was spent in France, the 
pharmacologists and biochemists were at the Lister Institute, and the applied 
physiologists at the London Hospital; the department of statistics occupied a house 
in Guilford Street, Russell Square. 

The work of the scientific staff of the potential institute during the years 1914-19 
was inevitably almost entirely concerned with problems raised by the war; this 
work I can only mention in passing. Sir Almroth Wright and S. R. Douglas, later 
joined by L. Colebrook and Alexander Fleming, were engaged on a continuous study 
of wound infections and dysentery. The pharmacologists and biochemists led by 

H. H. Dale worked on shock, on trinitrotoluene poisoning and, in conjunction with 
Clifford Dobell, on amoebiasis ; Dale, in particular, also gave much of his time to the 
control of drugs, especially the arsenical chemotherapeutic agents, and this work 
was of great importance in relation to later developments of biological standardiza- 
tion to which I shall have to refer. The applied physiologists were occupied with 
studies of environmental hygiene, and later in the war took part in the research on 
aviation problems in which Martin Flack was closely concerned. 

Possession of the Hampstead building was regained by the Medical Research 
Committee in June 1919, and work was begun on its conversion into laboratories in 
the anticipation of occupation by November of that year. Actually, although it was 
possible to instal the statistics department at an early stage, occupation by the 
laboratory workers was delayed until April 1920. Even then the equipment was 
incomplete, and it is recorded that only ‘minimum decorative completion’ was 
attempted. During the process of conversion of the hospital into laboratories the 
nurses’ home adjoining the main building was acquired by the Committee and let 
to the head of the department of biochemistry and pharmacology, H. H. Dale. 

In 1920 the Medical Research Committee was reconstituted under grant of a Royal 
Charter as the Medical Research Council, a body responsible to the Committee of 
the Privy Council for Medical Research, of which the Lord President of the Council 
is Chairman. In the Report of the Council for that year the establishment at 
Hampstead appears for the first time under its present name of the National 
Institute for Medical Research; it is noteworthy that by this time the plan for the 
incorporation of a research hospital within the Institute on which so much stress 
had previously been laid had completely disappeared, aiuhough no reference to its 
formal abandonment is to be found in the Reports of the CounoJ. Reference is 
made in 1920 to the Institute in the following terms: 

‘ The research work which has been in progress there in all the departments during 
the past six months has already proved, as the Council t h i n k, that the building, 
though constructed for hospital purposes, is admirably suited for the purposes now 


20-2 



296 Sir Charles Harington 

in view. This working experience has fully justified the original choice of this 
building made by the Medical Research Committee in 1914. With very little 
structural alteration the rooms have provided convenient and effective laboratories. 
The workers enjoy almost complete immunity from fog during the winter months, 
while the building and grounds, though within two minutes walk of the Hampstead 
Tube Station, are free from the noise of traffic and from electric or other disturbances 
outside.’ 

Although my present colleagues who have had to work under conditions of some 
overcrowding and discomfort during the last few years in the Institute may regard 
this as a somewhat rosy picture, there is no doubt that in fact the Hampstead 
building did serve its general purpose admirably for a long time and that the 
advantages of its situation were not overrated. Nevertheless, unforeseen develop- 
ments of work with special requirements soon made further provision necessary, 
and in 1921 the Council purchased a site at MiU Hill where farm laboratories were 
erected for investigations of viruses which could not be accommodated at Hamp- 
stead. Moreover, the value of the Hampstead building itself was greatly increased 
by the addition in 1928 of an animal house, the cost of which was provided from 
funds bequeathed to the Council by the late Lord Justice Ronan. 

The additional accommodation made available by the construction of the Ronan 
Building, and by the removal of the Department of Statistics to the London, School 
of Hygiene which occurred in 1 927, made the total laboratory facilities at Hampstead 
adequate until the decision of the Government in 1936 allocate an additional sum 
of £30,000 a year, to the Council for development of research in chemotherapy 
raised a new demand for expansion. Since it appeared that such expansion could 
not be satisfactorily carried out on the Hampstead site the Council made the 
important decision to erect a new and larger building on their property at Mill Hill 
to the acquisition of which reference has already been made. This new building was 
to be designed to house aU the existing activities at Hampstead and to provide for 
the necessary expansion of work in chemotherapy and in other fields of growing 
importance. 

Work on the new building was begun in 1937, and by the summer of 1940 the 
main structure was complete. At this time, owing to the situation created by the 
war, the work had to be interrupted and the building was lent by the Council to the 
Admiralty, by whom it was used as a training establishment for the W.R.N.S. until 
April 1945. Since then, with many delays, the work of completion of the building 
for the purposes for which it was originally designed has proceeded, and is now 
nearly finished, so that from a physical point of view the Institute at present stands 
on the threshold of the greatest development in its history. 

The organization of the Institute in its early d^ys was somewhat peculiar. We 
have seen that in the appointment of scientific staff the Council had selected leaders 
in three main branches pf laboratory work, in addition to statistics, which stands 
outside the present discussion. These senior members of the staff each had a depart- 
ment, but there was no Director. Each of the heads of departments, and, indeed, 
all other members of the scientific staff, were regarded as being directly responsible 
to the Council through the Secretary. 



The work of the National Institute far Medical Research . 297 

As might b© expected, such an arrangement ultimately proved inconvenient in 
practice, and in 1928 it was revised by the appointment of H. H. Dale as the 
Director of the whole Institute. Nevertheless, the idea which lay at the back of the 
initial organization was an important one; this was that rigid division of the 
Institute into departments was at all costs to be avoided. The importance of 
maintaining free intercourse between workers in the different laboratories at the 
Institute was repeatedly emphasized in the earlier Reports of the Council. It has 
remained a guiding principle in the administration of the Institute to the present 
day, and I sincerely hope that it will continue to do so in the future. 

Turning now to the actual scientific work of the Institute we have first to notice 
that the staff which finally assembled at Hampstead in 1920 was different in several 
important respects from that which had originally been appointed. At the end of 
the 1914:-18 war Sir Almroth Wright and Alexander Fleming returned to St Mary’s 
Hospital, and George Barger was appointed to the Chair of Medical Chemistry at 
Ediuburgh, whilst A. J. Ewins had already taken up industrial research work. 
When the Institute opened therefore the three laboratory departments were headed 
by S. R. Douglas, H. H. Dale and Leonard Hill respectively; important accessions 
to the staff at this time or within the next year or two were W. E. Gye, Clifford 
Dobell and P. P. Laidlaw in pathology, H. W. Dudley and Harold E^g in bio- 
chemistry and pharmacology, and Percival Hartley, who later took charge of the 
work on biological standards. 

For the first year or two at Hampstead senior members of the staff were chiefly 
occupied in winding up their researches of the war years, but soon several main 
lines of new work began to emerge, and it is on the discussion of some of these main 
lines that I must base my account of the scientiflo work of the Institute. One of the 
earliest researches to be begun was that on virus diseases; since this has remained 
a major theme at the Institute until the present time, and since the development 
of the work has some points of general interest I propose first to give this some 
attention. 

The importance of virus diseases in human medicine coupled with the ignorance 
prevailing at that time concerning the viruses themselves were the reasons for the 
decision to begin systematic investigations of the general problem. The first attempts 
were directed towards the cultivation of viruses in vitro; these having failed, 
attention was turned to a naturally occurring animal virus disease, s namely, dog 
distemper, which promised to afford a direct experimental approach. With the 
support of funds raised by the Field newspaper, a study of dog distemper was 
therefore undertaken, the leader of the investigation being the late P. P. Laidlaw 
with collaboration on the veterinary side from G. W. Dunkin who was appointed to 
the staff for the purposes of this work. Early on in the study a discovery which 
later proved to be of major importance was mad© in the scientific proof that the 
ferret was susceptible to distemper, a fact which had long been a matter of popular 
belief. This meant that a convenient experimental animal was available in which 
the disease could be produced at wiU, and the conditions were thus provided for 
a close study of the biological behaviour of the infective virus. In the corirse of 
a few years’ work sufficient knowledge was accumulated to make possible the 



298 Sir Charles Harington 

production of an effective vaccine against distemper, with the aid of which, as is 
now well known, the disease in dogs can be prevented. 

The problem of dog distemper having been solved, some time was spent in the 
study of various other viruses which could be attacked experimentally, including 
the bacteriophages which were then suspected and are now generally recognized to 
be viruses pnrnaH,iV.ing bacteria. Then in 1933 came the outstanding discovery by 
Laidlaw, Andrewes and Wilson Smith of the influenza virus. This discovery was 
by a return to the technique of the dog distemper work, the ferret again 
proving to be susceptible to the disease and providing the means for experimental 
study of the virus. Since that time of course various other techniques for the 
experimental investigation of viruses have been developed, most notably the 
method of cultivation in developing eggs first discovered by Goodpasture in the 
United States and later improved by F. M. Burnet working in the Institute, and 
there is no doubt that these techniques have greatly extended the possible range of 
investigations. Nevertheless, it can reasonably be clai m ed that the work on dog 
distemper followed by that on influenza really opened up the whole field of precise 
study of virus diseases in animals and man. 

One piece of work which at first formed part of the general virus investigation 
and which deserves special mention is that of W. E. Gye on the virus factor involved 
in the production of transmissible tumours. Although aU the perplexities of the 
problem have not been resolved, this work, since greatly extended by Gye himself 
under other auspices and by many workers elsewhere, together with the inde- 
pendent contribution made to it by C. H. Andrewes, has undoubtedly had a pro- 
found influence on cancer research, and it is a matter of pride that it was initiated 
at the Institute. 

The development of knowledge of the biological and epidemiological properties 
of viruses which resulted from the investigations which I have mentioned naturally 
led to the desire for more information about their physical properties. Of these the 
one which clearly first required attention was the actual size of the virus particles, 
and this problem was attacked ha two ways, namely, by the development of quanti- 
tative methods of ultrafiltration and by the refinement of optical technique. The 
wjork on ultrafiltration resolved itself into a study of the mode of preparation and 
of the behaviour of standardized collodion membranes having average pore sizes 
within closely defined limits ; by the use of such membranes which were developed 
by W. J. Elford in the course of a number of years of work and which are now in 
common use, it was possible to form an estimate of the size of the particles of a virus 
by ascertaining the m inim u m average pore size of a membrane which would allow 
the virus to pass. In this way the sizes of the particles of a number of viruses were 
assessed, and the values found, have in many cases been confirmed by the use of 
other physical methods since developed. 

The particles of most viruses are of course so smaU that they cannot be observed 
microscopically with the ordinary microscope; in order to obtain the degree of 
resolution required it is necessary to use light of wave-lengths shorter than those of 
the viable part of the spectrum. The use of ultra-violet light offered obvious 
possibilities in the direction required, and much effort was expended by J. E. 



The worTc of the National Institute for Medical Besearch 299 

Barnard at the Institute in the development of a satisfactory system of ultra-violet 
microscopy. The effort put into this work was richly rewarded, not only iu the 
results obtained in the observation of viruses for which it was originally carried out, 
but in the foundation which it laid for the modem cytological work which, in many 
laboratories throughout the world, is doing so much to unravel the chemical nature 
and the biochemical behaviour of the cell nucleus. 

The advent in recent years of the electron microscope, with a power of resolution 
of a different order from that of any optical instrument, has provided an ot h e r 
powerful tool for the study of minute objects. Full use has been and is now being 
made at the Institute of the electron microscopical technique for the AYa.mmg.t.inTf 
of viruses. With the aid of new methods of isolation and purification of viruses, and 
improvements in the method of preparation of specimens, the use of the electron 
microscope seems indeed to be placing the study of the morphology of viruses upon 
a new plane. 

Whilst great advances have been made in many laboratories m the acquisition 
of general knowledge about th^ behaviour and properties of viruses smoe the 
beginning of the work at the Institute, it cannot be claimed that progress with the 
practical treatment of the diseases which they cause has been equally rapid. Indeed, 
a new advance in this part of the field is the outstanding need of the present. 
Recent partial successes in the United States, for instance, the discovery of anti- 
biotics such as Chloromycetin which cures certain rickettsial diseases and aureomy cin 
which may be active agamst some true virus infections as well, seem to promise that 
the advance will come from chemotherapy, and this aspect of the problem is one 
which is naturally attracting a good deal of attention at the Institute as elsewhere. 

Apart from the direct scientific and medical significance of the work on viruses 
about which I, have been speaking, the progress of development of the work and 
the way in which the problem as a whole has been tackled, have, I tln'nTr^ some 
points of general interest to which I would like to refer. When the investigation was 
started it was naturally regarded as essentially a medical problem, and the work 
was therefore initiated by men whose experience lay in the fields of bacteriology 
and human pathology. As soon as it appeared that the direct attack was unpro- 
mising the direction of effort was diverted to an animal disease, the necessary 
expert help being obtained by the inclusion of a veterinary surgeon in the group 
responsible for the research. Again, as the general programme progressed, further 
additions were made for extensions of the work on the physical and optical sides. 
In each case the new men who joined the group did so as equal partners, and with 
fi?eedom simultaneously to develop their own ideas and lines of investigation, which 
they did with good effect. 

There is nowadays a good deal of discussion as to the part which can usefully 
be played by teams in research. It seems to me that the history of the virus work 
at the Institute which I have just described forms a good illustration of effective 
team work in medical research, although at no time has there been any pretence at 
the formal constitution of a team. It is perfectly clear that the field of scientific 
effort covered by the research as a whole has been so wide that no one man unaided 
could have made much progress. On the other hand, at the outset of the investi- 



300 Sir Charles Harington 

gation it was quite impossible to predict the devious paths which would have to 
be followed and the different branches of science into which excursions would have 
to be made. In these circumstances the course which was actually followed seems 
to have been the best, the essential feature being readiness at all times to extend 
the investigation in any desired direction by the addition of the help of an appro- 
priate worker. In this way there is in fact ultimately formed a team of investigators 
whose efforts are all generally directed to the same end; but the team is never 
rigidly defined; it grows only in response to the demands of the work, and the 
individual members retain a considerable degree of scientific independence and 
scope for initiative. I am myself convinced that, for the organization of work on 
problems requiring co-operative effort for their solution — and this includes many 
problems^ of medical research as it has developed to-day — ^these are the correct 
principles. 

This, however, is a digression, and I must return to the main theme of the 
scientific work of the Institute. The next line of investigation to which I wish to 
refer, particularly because it has been mainly associated with the name of the first 
Director, to whom it eventually brought the honour of the Nobel Prize for Medicine, 
is one which in its various ramifications has occupied the attention of a large 
proportion of the scientific staff of the Institute for many years. In general, it may 
be defined as the study of the chemical control of bodily functions, and, in fact, it 
has been mostly concerned with the biochemical and physiological investigation of 
two compounds, histamine and acetylcholine. 

The fiirst of these had long been known as a substance with intense physiological 
activity, which occurred in nature as a constituent of ergot and as a metabolic 
product of the action of certain bacteria on the commonly occurring amino-acid 
histidine. Several of the effects produced by the administration of histamine were 
closely similar to those observed in anaphylactic shock, and largely as the result of 
work by Dale it had come to be assumed that the phenomena of anaphylactic shock 
and of allergy in general were actually due, at least in part, to the liberation of 
histamine. It also appeared, both frop. observations by the late Sir Thomas Lewis 
on the nature of the response of the skin to injury, and from further physiological 
studies of histamine carried out in the Institute, that liberation of this compound, 
with its known power of dilating capillaiies and causing changes in their per- 
meability, might be responsible for many of the local reactions of tissue to 
trauma. 

These suppositions naturally implied the existence in normal tissues of a source 
of histamine, a substance which had so far only been found in the mammalian 
organism in a situation in which it was presumably a product of bacterial action. 
In parallel with the physiological studies therefore a systematic search for histamine 
in various normal animal tissues was begun by H. W. Dudley in the Institute, and 
led to the discovery that histamine in a bound form was in fact distributed quite 
widely in animal tissues, in some of which, particularly lung, it occurred in sur- 
prisingly large amounts. Thus the objective evidence was supplied which was 
necessary to establish the supposed role of histamine in producing the phenomena 
of anaphylactic shock, allergy; and local tissue reaction to injury. 






(Facing p- 300 ) 




1 f -Lit' 




New National Institute for Medical Research, Mill Hill. (In course of completion.) 

View from the south-east. 


new National Institute for Medical Research, Mill Hill. (In course of completion.) 


301 


Th& work of the National Institute for Medical Resect/rch 

In the course of the biochemical work on histamine a discovery was made which 
turned out to be of great physiological importance ; this was the observation that 
extracts of normal spleen contained small amounts of the highly active substance 
acetylchoHne. This compound was hitherto known in nature only as a constituent 
of ergot. Sufficient information had, however, already been acquired about its 
physiological actions to suggest that either acetylcholine itself or a substance 
closely resembling it might be concerned in the production of the effects of stimu- 
lation of the parasympathetic nerves. The demonstration of the actual occurrence 
of acetylcholine as a normal constituent of the body naturally gave a great stimulus 
to this idea which was actively developed by Dale with a succession of colleagues 
including J.^ H. Bum, J. H. Gaddum, W. Feldberg, G. L. Brown andF. 0. Macintosh, 
in work extending over many years at the Institute and by workers elsewhere. The 
details of the investigations are not for discussion here; the great generalization to 
which the work led may, however, be recalled: this is the conclusion that acetyl- 
choline is the substance responsible for the transmission of nervous stimuli from 
the peripheral ends of paras 3 mipathetic nerves to plain muscle, through the gan- 
glionic synapses of the sjonpathetic nervous system, and at the nerve endings of 
motor fibres to voluntary muscle; that is to say, it is the chemical transmitter for 
the effects conveyed by the whole of the efferent fibres of the peripheral nervous 
system except the postganglionic fibres of the sympathetic system for which a 
similar part is played by adrenaline. Not ordy did this generalization form a new 
chapter in neurophysiology, but the implications of the work as a whole for neuro- 
logical medicine have been highly significant. 

Turning to another series of physiological and biochemical researches, the subject 
of carbohydrate metabolism in its various aspects is one which has attracted a good 
deal of attention at the Instifute. The interest aroused by the first announcement 
in 1922 of the discovery of insulin at Toronto caused the Council to send H. H. Dale 
and H. W. Dudley to Canada in order to report on steps which should be taken to 
enswe the rapid and proper development of the discovery so far as this country was 
concerned. The immediately important result of this mission was the acceptance by 
the Council of the British rights in the patents covering insulin manufacture, and 
the employment of the control given by these rights to establish British production 
of insulin on a scientifically sound basis. 

A contribution to the method of purifying insulin was made at the Institute 
which was of some considerable use in the early stages of its mamxfacture in this 
coimtry , and the methods for its biological assay were studied in detail in connexion 
with the control of the quality of the product sold to the pubho. Apart from this, 
however, the avafiabihty of insulin stimulated interest in the details of the hormonal 
control of intermediate carbohydrate metabolism. In the course of some years 
experiments were carried but which revealed several of the underlying facts deter- 
mining the utilization of carbohydrate in the animal body. A later development 
arose from the study of the role of hormones produced by the anterior pituitary 
gland in the control of carbohydrate metabolism, and this eventually led to what 
is probably the most important discovery in this particular field of work to be 
made in the Institute, namely, the observation by F. G. Young that it was possible 



302 


Sir Charles Harington 

by purely hormonal means to produce experimentally in dogs and cats a permanent 
condition of diabetes indistinguishable from the natural disease as it occurs 
in man. 

Before leaving the subject of biochemistry there is one other investigation which 
I should like to mention, since the results of the work will always be associated 
with the name of the Institute ; this is the research work which led to the identifica- 
tion of one form of vitamin D — ^that which is now known as vitamin D 2 . The whole 
study was again pre-eminently an example of co-operative research ; it started with 
an inquiry into the nature of the process by which vitamin D activity was conferred 
on sterols by irradiation with ultra-violet light; this inquiry revealed the identity 
of the provitamin with ergosterbl and opened the way for experiments on the direct 
irradiation of this compound which ultimately led to the isolation of pure vitamin 
Dg, or, as it was called, calciferol. This work is a further example of a research 
carried out by a team which grew during its progress; having been im'tiated by 
O. Bosenheim and T. Webster it was continued with collaboration, among others, 
from E. B. Bourdillon on the physical side, and in its organic chemical and 
biochemical aspects from E. K. Callow and Miss H. M. Bruce. 

Apart from this detailed study of one vitamin, and a contribution on the nutri- 
tional significance of phytic acid in cereals, nutritional research has not been under- 
taken to any considerable extent within the Institute itself; on the other hand, in 
the associated Nutrition Laboratory a continuous programme of such work has 
been maintained, amongst the results of which we may recall the investigation of 
abnormalities of bone growth associated with vitamin A deficiency which formed 
the subject of Sir Edward Mellanby’s Croonian lecture to this Society in 1943, 
and the recently much discussed discovery of the toxic effects of ‘agenized’ 
flour. 

The Medical Eesearch Council have always shown themselves to be fully con- 
scious of the contribution which chemistry can make to medical research. We have 
already seen the part which chemists and biochemists have played as collaborators 
in several of the major themes of research which I have mentioned. They have, 
however, also made their own independent contributions, and particularly in the 
development of chemistry in relation to pharmacology and chemotherapy. We may 
recall, for instance, two advances which have resulted from a large amount of work 
on alkaloids. The first of these arose out of chnical observations by Chassar Moir 
at University College Hospital ; it led to the isolation by H. W. Dudley from ergot 
extract, at a time when it was thought that everything was known about the 
alkaloidal content of this material, of a new alkaloid which actually turned out to 
be the really important constituent of ergot from the point of view of its practical use 
in medicine. The second is the work of Harold Eong on curare alkaloids which has led 
to the isolation and chemical identification of the pure alkaloid B-tuboourarine 
which is individually responsible for the typical physiological action long associated 
with curare. This work also has had its therapeutic application in the extensive use 
which has been and is still being made of B-tubocurarine as an adjuvant in anaes- 
thesia to facilitate the task of the surgeon by providing more complete muscular 
relaxation. As an extension of the investigation of curare an interesting advance 



The work of the National Institute for Medical Research 303 

has been made quite recently whicli again may have practical application in 
medicine; this consists in the developments by chemists and pharmacologists 
working in collaboration, of a series of simple bi^quaternary salts, one of which 
promises to be therapeutically equal to or better than n-tubocurarine, whilst others 
may find therapeutic applications, of quite a different kind. 

So far as chemotherapy is concerned, this has always been a primary interest in 
the organic chemical laboratories of the Institute. It cannot be denied that the 
workers in these laboratories have had their fuU share of the disappointments 
which seem to be inevitably associated with this most exacting and tantalizing 
form of scientific effort. Chemotherapy is a subject in which, at least until recent 
years, there has been little or no basis of theory upon which a logical attack of 
a problem can be planned. This being so, the probability of success has necessarily 
been determined by the quantitative magnitude rather than the qualitative nature 
of the experimental investigation. In comparison with the effort which has been 
put into chemotherapy by large industrial organizations, that at the Institute has, 
of course, been extremely small; nevertheless, it has not been without its successes, 
and in one case at least, namely, in the part played in the development of the 
amidine drugs for the treatment of kala azar, the practical outcome has been 
significant. At any rate there is every intention that chemotherapy in all its aspects 
shall continue to be a principal feature of research work at the Institute, particularly 
in relation to the outstanding problems of the chemotherapeutic treatment of 
tuberculosis and of true virus diseases. 

The staff of the chemical laboratory at the Institute have also been responsible 
for a major theoretical contribution to chemistry. This was the suggestion which 
was advanced in 1932 by Rosenheim and King for the structure of cholesterol and 
which brought about a reorientation of the chemistry of the whole group of steroids 
— a group that includes so many compounds of biological importance. 

One of the more remarkable developments in medical and biological research in 
recent years is the increasingly important part which is played by physical methods. 
So far as the Institute is concerned the illustration of this, which is very much in my 
mind, is that the plans which had been made immediately before and during the 
early part of the recent war for the occupation of the new building have had to be 
fundamentally revised to provide the new facilities which are demanded by this 
particular branch of the work. We have already seen how even in the early stages of 
the virus work special physical and optical techniques had to be employed in order 
that the progress of the work should be maintained, but at this time the techniques 
in question were relatively simple, and modest in their demands for space and 
personal effort. Now it is entirely necessary to provide facilities for much more 
comphcated procedures; such as phase contrast microscopy, electron microscopy, 
elecirophoresis, high-speed centrifugation and* ultrasonics. In addition to all this 
the introduction of the use of isotopes as valuable tools in biological research and 
particularly the almost unlimited availability of radioactive isotopes for this purpose 
has necessitated the undertaking of an amount of purely physical work which 
would have seemed fantastic for an institute of medical research even as little as 
10 or 12 years ago. 



304 Sir Charles Harmgton 

AH the physical techniques which I have mentioned are now available in the 
Institute, where they are used not only by workers within the Institute itself, but 
for those engaged in medical research problems in other institutions, who require 
to use special physical methods for their work but have not themselves access to 
the necessary apparatus and skilled help. This state of affairs is particularly notice- 
able in respect of the isotope work; on the one hand, the Institute mass spectro- 
meter is made available for analyses for medical research workers in Medical 
Research Council Units or University laboratories who are using stable isotopes in 
their experiments; on the other, the staff of the Biophysics division have the task, 
at least so long as the present position of dependence on American supphes persists, 
of receiving, assaying and distributing the radioactive isotopes required for the 
many research projects which are being carried on throughout the country with the 
aid of these materials, in the medical and biological field. Naturally the ready 
availability of this variety of physical techniques within the Institute is itself 
a stimulus both to the further development of the methods themselves and to their 
application wherever this seems likely to aid the solution of a problem. ' 

To return to the biological work of the Institute I must first refer to that of the 
Department of Applied Physiology. It will be remembered that this department, 
under the leadership of Leonard Hill, was one of the main divisions at the outset of 
the activities of the Institute, and it remained so until Hill’s retirement in 1930. 
The first main theme of work of this department was the study of environmental 
fac|jors in their relation to health and activity, and among the more important 
results of this work was the development of the kata-thermometer, an instrument 
which has been widely used for many years and is still employed with good effect in 
the assessment of conditions of ventilation. Much time was also spent in the study of 
the physiological effects of ultra-violet hght, and this work linked up with the 
inquiry already described which ultimately led to the isolation of calciferol. In still 
another aspect of its work the department was concerned with the physiological 
effects of high and low atmospheric pressures, the former particularly in relation 
to the problems of deep-sea diving. It is of some interest to note that after a lapse 
of 20 years there is once more to be a division of Applied Physiology within the 
Institute, the staff of which will again be engaged in the study of the effect of 
environmental conditions on various human activities. 

Shortly after the department of applied physiology ceased to exist as a separate 
division the biological work of the Institute was extended in another direction by 
the establishment of a division of endocrinology under A. S. Parkes. This became 
a widely recognized centre for the study of the sex hormones in particular, and was 
responsible not only for advances in the knowledge of the fundamental physiology 
of these compounds, but for practical discoveries which facilitated their therapeutic 
use. The interest of the workers in this department in the anterior pituitary gland 
had also much to do with the development of the experiments of F. G. Young on 
diabetes which have already been mentionedr As time has gone on the activities of 
this division have extended beyond the limits of endocrinology, and its field of work 
is now better covered by the more general title of experimental biology, the current 
interest being chiefly in problems of fertility. 



305 


The work of the National Institute for Medical Research 

I come now to a part of the work of the Institute which is not confined to pure 
research, namely, the maintenance of biological standards. I have already referred 
to the work which had to be done during the 1914-18 war on the control of drugs, 
particularly arsenicals, and for which H. H. Dale was chiefly responsible. This work 
was the foundation of an enduring interest in problems of standardization of 
physiologically active compounds whose potency had to be assayed by biological, 
means. Many substances which have important uses in therapeutics are not, 
particularly when they are first introduced, sujficiently well defined to be properly 
assayed by chemical and physical tests. Such a substance therefore has to be tested 
for potency by a biological test for its specific activity, and all biological tests are 
subject to an inherent variabihty which makes them unsuited in themselves as 
a basis of quantitative measurement. The only way out of the difficulty is to set up 
a permanent standard preparation of the substance in question and to define 
biological activity in terms of a known weight of this standard preparation. If new 
preparations are then tested biologically in direct comparison with the original 
standard or with a substandard prepared therefrom their activity can be expressed 
in terms of a known weight of a stable substance, instead of in terms of the behaviour 
of an animal on a particular day. It is on this fundamental principle that the whole 
work of biological standardization is based. 

In the Report of the Council for 1920 we find the remark that at every point the 
work of the departments at the National Institute for Medical Research has concern 
with biplogical standards, and it is noted that a department of biological standards 
had been set up at the Institute under H. H. Dale with appointment of an additional 
member of the staff for special work in connexion with it. 

The main types of substances for which biological standards are needed are 
serological preparations such as antitoxins, vitamins, hormones, and certain drugs 
such as digitalis. In the early years at the Institute much work was done on the 
preparation of standards for diphtheria and tetanus antitoxins, digitalis, and 
posterior pituitary gland. In 1923 steps were taken to correlate the work which 
had been done up to that time on biological, standards in this and other countries, 
and international agreement was obtained for the establishment of standards for 
diphtheria antitoxin and posterior pituitary gland. In the same year the League of 
Nations Health Organization assumed responsibility for biological standardization 
as a whole ; the international status of the work has been maintained from that 
time ; after the recent war responsibility was taken over by an interim com m ission 
and it is now passing into the hands of the World Health Organization. In 1922 
began the long association with work on standards of Peroival Hartley, who was 
appointed to the staff in that year and later directed the Department of Biological 
Standards within the Iixstitute until 1946. 

A further event which was of importance to medicine in this country occurred in 
1925 when the Therapeutic Substances Act was passed. This Act provided for the 
official control of therapeutic substances whose potency had to be determined by 
biological means and in the exercise of this control a new responsibility was placed 
upon the staff of the Department of Biological Standards at Hampstead. In the 
schedules of the Act the units of activity of each of the therapeutic preparations 



306 


Sir Charles Harington 

are defined in terms of the standards which are preserved in the National Institute 
for Medical Research. The Department there is responsible for the issue of samples 
of the appropriate standards to approved manufacturers who may require them, 
and, furthermore, for ensuring that the products placed on the market by these 
manufacturers satisfy the requirements laid down in this Act. 

As time has gone on and as new biological products have been brought into 
therapeutic use, the work of the Department of Biological Standards has consider- 
ably increased. There are now some thirty-seven standards preserved at Hampstead, 
of which thirty-five have international status, mostly in the groups of vitamins, 
hormones and serological preparations, with the addition recently of antibiotics 
such as penicillin and streptomycin. Samples of these standards are, as I have said, 
issued to manufacturers, who customarily prepare their own substandards for the 
control of their products, and with whom the closest touch is maintained on all 
technical questions arising from the work. The standards are also available to 
research workers in aU countries. In many countries National Control Centres have 
been established, and where these exist distribution of the ultimate reference 
standards from Hampstead to the countries concerned is effected through 
them* 

The actual way iii which the work on standards is done at the Institute is perhaps 
worth noting. The amount of technical work required in connexion with the 
establishment of new standards may be considerable, but the demands for such 
work are inevitably intermittent, and the experimental technique required may 
fan m any one of a large variety of fields. For this reason it has not been thought 
wise to establish a large department devoted exclusively to work on standards. The 
director of the Department has the responsibility for the co-ordination of the whole 
of the work, and for expert advice and assistance he is able to call on his colleagues 
in the other departments of the Institute who possess the relevant special knowledge. 
This arrangement has the great advantage that the necessary technical work 
becomes an occasional demand on the time of members of the scientific staff who 
are otherwise engaged on their own research problems, instead of being a fuU-time 
occupation of a somewhat restrictive character. In actual fact many members of 
the staff who have given most help with standards, including those in the Depart- 
ment itself, have themselves been responsible for some of the more important 
contributions in research. One need only recall the researches of Hartley himself on 
general problems of immunology and those of Bruce White on the cholera vibrio to 
realize the truth of this statement, whilst among workers less closely connected 
with the Department, the physiologists and the endocrinologists have given much 
valuable assistance to this work. 

Biological standards, besides forming an essential part of the system of control of 
biological therapeutic products, offer a substantial service to research ; moreover, the 
work which arises in connexion wdth them itself stimulates research particularly in 
the important field of biological assay. The existence of the Department is further- 
more of great value to the Institute itself; through the department valuable contacts 
are maintained with the research laboratories of industrial firms, and, since the 
preparatory work for the establishment of new standards is always done so far as 



The work of the National Institute for Medical Research 307 

possible with international co-operation, contacts are also close with research 
workers in other countries. 

There is no part of a research institute which is of more importance to the general 
progress of the w;ork than the Library; I cannot conclude my description of the 
National Institute for Medical Research without mentioning that as a result of the 
hberal policy of the Council and of the devoted efforts of the Hbrarians the Institute 
now possesses a library which, having started from very small beginnings, has 
become one of the best medical research hbraries in the country. The facilities of 
this library are of course not restricted to workers within the Institute itself, but 
are available to other workers under the Council, and, by special arrangement, to 
medical research workers elsewhere. 

This brings me to the end of my outline of the main activities of the National 
Institute for Medical Research. It has been an account which makes no claim to 
completeness, but in which selected themes have been briefly discussed as illustra- 
tions* of the type of work which is done and of the main trends of development. 
The story which I have told is one of an expanding effort, the expansion relating 
both to the numbers of the scientific staff and to the range of scientific work which 
is covered. This expansion has been continuous, not even being completely inter- 
rupted by the recent war. Luring the war the activities of the Institute were fully 
maintained, although there was inevitably and properly a considerable temporary 
redirection of the research effort to problems of immediate importance; among 
these may be mentioned in passing the work of the physiologists on underwater 
physiology in connexion with submarine warfare, work on measures of protection 
of troops against flamethrower attack, and studies of protective measures against 
rickettsial diseases of military importance. With all this it was possible to keep pace 
with new technical developments and to maintain a background of fundamental 
research so that resumption of normal activities on an increased scale was not too 
long delayed. 

Looking in more detail at the process of expansion of the work at the Institute 
&om the early days of the three laboratory departments to the present very much 
larger and more complex establishment we see an increasing emphasis on bio- 
chemistry and biophysics on the one hand together with a broadening of the 
biological research through endocrinology to more general aspects of experimental 
biology on the other. In these respects the development is a true reflexion of the 
general trend during the past thhty years of medical research in the laboratory, 
which must make ever wider demands on the whole field of scientific effort if it is 
to progress. The very increase in the range of work which has to be covered, with 
its accompaniment of multiplication, of departments with differing immediate 
interests, adds weight to the principle laid down by the Council in the early days 
of the Institute that there must be the freest contact between the different 
laboratories. 

If the organization which I have depicted seems to some to be ill-defined and 
loose in structure, this is because of the pre-eminent importance which is attached to 
the principle of flexibility and avoidance of departmentalization. It is, indeed, in 
my view only with the strictest adherence to such a principle, and within the 



308 


Sir Charles Hasringtoix 

framework of the early declaration of research policy which I quoted at the beginning 
of this lecture, that we can hope for the Institute to continue to fulfil its proper 
function of producing new knowledge in science and medicine, and in doing 
so to live up to the achievements of the past, of which it may reasonably be 
proud. 


Kinetics of the base-catalyzed halogenation of some 
ketones and esters 

By R. P, Beuc,, r.R.S., E. Gelles and Eva Mollbe 
{Received 1 March 1949 ) 


Kinetic measuxements have been made at 25° 0 on the halogenation of benzoylaoetone, 
acetylacetone, ethyl acetoacetate, ethyl a-bromoacetoacetate and diethyl bromomalonate. 
A method is described for analyzing the kinetic data and obtaining the rates of substitution 
of successive halogen atoms without isolating the partly halogenated derivatives. 

The reaction velocities measured are all independent of the halogen concentration, and 
represent the rates of ionization of the ketohic substances in presence of basic catalysts. The 
results obtained conform in general to the regularities previously foimd for this type of 
reaction, but anomalies are found in some instances; these are related to the interaction of 
large substituent groups in both substrate and anion catalyst. The effect of bromme sub- 
stitution in mcreasiog the reaction velocity is shown to decrease as the reactivity of the 
ketone increases, cmd this is explained in terms of the charge distribution in the anion of 
the substrate. 


Inteodtjction 

The original object of this work was to extend to benzoylacetone the investigations 
of base-catalyzed prototropy which have already been made with a series of ketonic 
substances (Bell & LidweU 1940; Bell 1943; Bell, Everett & Longuet-Higgins 1946; 
Bell, Smith & Woodward 1948). However, measuxements with this ketone led to 
two more general points of interest. In the first place a new method was devised for 
obtaining the rates of substitution of successive halogen atoms without isolating the 
partly halogenated derivatives, and this method was used to re-examine the bromina- 
tion of acetylacetone and acetoacetic ester. In this connexion measurements were 
also made on the bromination of a-bromoacetoacetic ester and bromomalonic ester. 
In the second place it was found that certain anion bases showed an unexpectedly 
great catalytic effect in the bromination of benzoylacetone and bromoacetylacetone, 
and the same anions were therefore investigated as catalysts in the bromination of 
acetoacetic ester. 

Measttbembnts with acetylaoetohe ahd benzoylacetone 
{a) Trmtrmnt of successive reactions 

As with aU the substrates mentioned in this paper, the rate of bromination of 
these two ketones is independent of the halogen concentration, and thus depends on 
the rate of ionization of the ketone in presence of the basic catalyst, usually an anion 



Kinetics of base-catalyzed halogenation of ketones and esters 309 

B . In each, case two of the hydrogens of the group — CO . CH2 . 00 — are eventually 
substituted by bromiue, and the general kinetic scheme may be written as follows: 

i2H2 + B~ ER- + BB. 

^H- + Br2-^iJHBr+Br- 
iilHBr + B~ i2Br" + HB 

i2Br-+Br2-^ iiBrj+Br 

In this scheme iZHj and iJHBr represent the keto-forms of the ketonie substances 
concerned, and and are first-order velocity constants, the catalyst concentration 
remaining effectively constant. In previous work with acetylacetone (BeU & lidweU 
1940) it was assumed that k^ ^ leading to simple first-order kinetics, but we shall 
see that this assumption is not justified. The more general case (ki^'^k^) has been 
treated by Pedersen (1933), who has also taken into account the equilibrium fraction 
of enol, e, present in the original ketone solution, which will be brominated instan- 
taneously. The resulting kinetic expression is 

2(r-l)(c-ia:)/c = (l-e)(2r-l)10-*i<-(l-re)1.0-*'^, ■ (1) 

where c = initial total ketone concentration, 

X = equivalents of bromine present at time t in JZHBr -f- iJBra, 
ki, k^ = first-order constants in terms of decadic logarithms, 

T = 

This expression reduces to a simple first-order change only if or == 2Aa. 

In other oases it should be possible in principle to deduce the values of and k^ 
from the observed course of the bronaination, but in practice this can only be done 
by a laborious process of trial and error, and the resulting values of k^ are not of high 
accuracy (cf. Pedersen 1933, 1934). 

If it is possible to study separately the bromination of JJBCBr, thus obtaining k^, 
then kj^ can be obtained fairly easily from the observed bromination kinetics of BHa. 
This is most readily done by a process of interpolation. A value is assumed for r 
( = k^jkj), and values of the expression 

X = logio{2(r- 1) (c-ia:)/c-H (1 -re) 10-*»0 (2) 

are plotted against t If the correct value has been chosen for r, the resulting plot 
■wiU be a straight line of slope ( == kjr), while an incorrect value for will give a 
hne which may be noticeably curved, and will in any case have a slope diJffering from 
the value assumed for By taking a series of values for k^ the correct value can be 
obtained by interpolation. (If k^ is appreciably greater than an approximate 
value for kj^ can be obtained by plotting log^o against L) 

This procedure is satisfactory only if r differs considerably from unity. If we put 
r = 1 in equation (2) the first term vanishes, and for any value of r close to unity the 
plot of X against t is insensitive to the experimental values of x. In the range 
0*7<r < 1*3 the best method is to write r 1+^ in equation (1) and to expand In 


Vol. 198. A. 


21 



310 R. P. Bell, E. GeUes and Eva MoUer 

powers oivi. Correct to terms in 7]^ (which is sufficient for present purposes) equation 
(1) then becomes 

{c—lx)lc= — ie+l'16fe2t(l — e— 

-l-33i|i^(l-e-2i/ + e^) + l-02(l-e))i:|f®?/2}. (3) 

Defining an expression T by 

Y = logio{(e-i*)M-logio{l - ie+ hl5k^t{l-6-ri+V^) 

- 1-Z3klf7}{l - e- 27} +67j)+ 1-02(1 - e) 14^7}% (4) 

r is plotted against t for a series of values of t}. When the right value of tj is chosen, 
the resulting plot will be a straight line of slope kj^ = kjr = k^Kl+ij). Examples 
of these procedures are given later in the paper. 

The monobromo-derivatives of acetylacetone and benzoylacetone are not well 
characterized, and no attempt was made to prepare them in a pure state. Their 
bromination in solution can, however, be studied by the following procedure. Both 
acetylacetone and benzoylacetone are sufficiently strong acids (pK~9) to be 
converted almost conipletely to the enolate ion by a slight excess of alkali. Both 
the ion and the enol are known to react instantaneously with bromine, so that on 
adding an excess of bromine solution (if necessary cbntaming buffer solution) to 
a solution of ketone containiog a slight excess of alkali the monobromoketone is 
formed instantaneously and is then brominated further at a measurable rate, 
enabling ^2 to be measured directly. 

(6) Exfp&iToental procedure 

Acetylacetone was redistilled twice and had b.p. 139° C. Benzoylacetone had 
m.p. 67° C, unchanged by recrystaUizing from aqueous alcohol. The acids used for 
preparing buffer solutions were either A.R. samples or were purified by recrystaUiza- 
tion or distillation, their purity being checked by titration against standard sodium 
hydroxide solution which was in turn standardized against constant-boiling hydro- 
chloric acid. The glycoUate buffer solutions were made from recrystallized sodium 
glycollate. All solutions were made up with boiled-out distilled water using grade A 
volumetric apparatus. The automatic pipette used for delivering bromine solutions 
was of the type described by Bell, Lidwell & Vaughan-Jackson (1936). 

The kinetio m^surements were carried out at 26 ± 0-01° C, and the ketone con- 
centration in the reaction mixtures was about 4 x 10~* throughout. Owing to the 
limited solubihty of benzoylacetone in water some of the measurements with this 
ketone were made in solutions containing 3 % of alcohol or acetone, it being found 
that 7 % of organic solvent had no detectable effect upon the reaction velocity. In 
measuring the two-stage bromination process, 20 ml. of a solution was prepaired 
contaaning ketone, buffer, and enough sodium chloride to make the ionic strength of 
the final solution up to 0-1. Approximately 1ml. of 0-6M;-bromine in potassium 
bromide solution was transferred by an automatic pipette into a thin-walled bulb 
which rested inside the flask containing the ketone solution. When the whole had 
reached theormostat temperature the reaction was started by breaking the bulb 



Kinetics of base-catalyzed halogenation of ketones and esters 311 

while shaking the flask and simultaneously starting a stop-clock by depressing a key 
with the foot. After a suitable interval the reaction was stopped by adding a few 
drops of 10 % allyl alcohol solution, and the clock simultaneously stopped, recording 
the reaction time to the nearest 0*1 sec. The mono- and dibromoketone were then 
converted into iodine by adding a crystal of potassium iodide and 2 ml. 2N-hydro- 
chloric acid; liberation of iodine was complete within a few minutes, and it was 
estimated by titration with n/100 or n/200 thiosulphate using a microburette and 
sodium starch glycollate indicator (Peat, Bourne & Thrower 1947). A series of 
experiments (normally ten to fifteen) with different times was carried out for each 
reaction mixture. The second stage of the reaction was followed in a similar manner, 
except that the ketone solution contained slightly over 1 equivalent of sodium 
hydroxide. The anion catalyst was contained in the same solution, and the admixture 
of the bromine solution (containing, if necessary, some strong acid) produced a 
buffer solution of suitable ratio. Experiments with varying concentrations of strong 
acid (and hence no basic catalyst other than water) gave identical results. There is 
therefore no detectable catalysis by acids, and the buffer ratio in the experiments 
with buffer solution is not in itself important, though it must be known in order to 
calculate the correction due to hydrolysis of the bromine (see below). 

The above technique gave reliable results with reaction times as low as 2 sec., and 
the half-times of the reactions studied varied between about 10 and 150 sec. The 
observed titres for a typical experiment (the water reaction for acetylacetone) are 
shown in figure 1 . It will be seen that the titres for long reaction times do not approach 
closely to the theoretical end-point, and in fact show a tendency to decrease again. 
Benzoylacetone behaved in the same manner, though to a less marked extent, and 
a similar decrease in reactive halogen content has been observed in other caises (BeU 
& LidweU 1940; Pedersen 1933, 1934); it is probably due to hydrolytic fission of the 
halogenated ketone. This complication does not affect the earlier part of the reaction, 
and in calculating velocity constants only the first half of the reaction was used for 
acetylacetone, and the first two-thirds for benzoylacetone. 

In the overall bromination experiments the extrapolated initial titres agree closely 
with the equilibrium enol contents of the aqueous solutions (17 % for acetylacetone, 
34 % for benzoylacetone; see Eidinoff 1945), assuming one bromine atom to be 
introduced instantaneously. When the second stage of bromination was being studied 
the initial value corresponded closely with 100 % enol, and was independent of the 
interval (up to 20 min.) which elapsed between making the solution alkaline and 
adding the bromine solution, thus showing that the alkali had not caused any 
appreciable fission of the ketone. In either case, if is the calculated titre for com- 
plete bromination, and the observed titre at time ^,it was found that log^j 
is a hnear function of t. This is illustrated by figure 2, which gives four specimen plots 
for experiments with acetylacetone in monochloracetate buffers. The arrows in- 
dicate calculated initial titres. For the second stage of the reaction the slope of these 
plots gives ig directly, and the values of k^ thus obtained are a hnear function of the 
anion concentration, as shown in figure 3. For the overall reaction, however, the 
slope has no simple significance, and the data must be treated by the method described 
in the last section. The values of the ratio r differ sujBB.ciently from unity to justify the 


21-2 



312 


R. P. Bell, E. Gelles and Eva MoUer 

application, of the first method described (cf. equation (2)). As an example of the 
results obtained the data for monochloracetate solutions are given in table 1. In 
this and iu all subsequent tables the values of c, the anion concentration, have been 



0 1 2 3 4 5 

time (min.) 


IFiGtritB 1 



time (sec.) 


Figttbe 2 



Kinetics of base-catalyzed halogenaiion of hetones and esters 313 

corrected for the acid produced and for the hydrolysis of bromine as described in 
earlier papers (Bell et al. 1948, p. 483). The values of are interpolated from the 
plot shown in figure 3. 



Figube 3 


In the experiments with monochloracetate solutions recorded in table 1 a consider- 
able proportion of the observed reaction is due to water catalysis, and the observed 
value of r is therefore intermediate between the value for water (r,, = 1-35) and the 
value for catalysis by monochloracetate. If and are the observed velocity 
constants for the bromination of bromoacetylacetone in water and in monochlor- 
acetate solution respectively, the value of the ratio r obtained by the above procedure 
is given by 

( 6 ) 

whence the value of (given in the last column of table 3) can be obtained in terms of 
measured quantities. 

The date for the other anions are similar, and only a summary will be given (table 
2). The catalytic constants (kj,) for bromoacetylacetone were obtained from the plot 
of k^ against c (see figure 3), and those for acetylacetone derived by using the values 
of calculated as in table 1. To facilitate comparison with the results of Bell & 
LidweU (1940) the catalytic constants are expressed in l./molemiu., i.e. the vdooity 
constants of table 1 have been multiplied by 2-303 x 60. The catalytic constants for 



314 


R. P. Bell, E. Gelles and Eva Moller 


bromoacetylacetone are probably not more accurate than ± 6 % because of the 
very high rates studied, and those for acetylacetone will be subject to somewhat 
greater errors in view of the indirect method used for deriving them. 

Table 1. Monochloracetate catalysis ik the 

BROMIHATION OF ACETYLACETONE 

c = anion concentration, buffer ratio (aoid/anion) = 0‘60. 

Jcj, = first-order constants, sec.“^, logio* 

(a) calculated from r and (h) from plot of equation (2). 


value best 

assumed ^ ^ ^ value 


c 

10% 

for r 

(a) 

(&) 

di£f. 

of r 


0 

100 

1-2 

83 

80 

- 3' 





1-3 

77 

76 

- 1 





1-4 

71 

72 

+ 1 

l-Sg 




1-6 

67 

70 

+ 3j 



0-018 

150 

1-3 

115 

106 

- 9] 





1-4 

107 

103 

- ^ 

1-6 

1-9 



1-5 

100 

100 

oj 



0-055 

260 

1-3 

193 

167 

-26] 





1-4 

179 

172 

- 7 





1-5 

167 

163 

- 4 

1-6 

1-8 



1-6 

166 

156 

- 1| 





1-7 

147 

162 

+ sj 



0-116 

430 

1-6 

268 

269 

+ 1 

1-6 

1-7 







mean 

= 1-8 


Table 2 


catalytic constant for 

. - A 

( ' ■> 

catalyst acetylacetone bromoacetylacetone 


water 

1-03* 

1-39* 

1*^6 

chloroacetate 

‘ 20 

36-6 

1-8 

w-nitrobenzoate 

45 

196 


glycoUate 

56 

100 

1-8 

>5-chloropropionate 

77 

225 

2-9 

phenylacetate 

90 

400 

4-5 

benzoate 

96 

280 

2-9 

acetate 

160 

270 

1-7 

>^-phenylpropionate 

180 

780 

4-2 

propionate 

190 

370 

1-9 

trimethylacetate 

220 

720 

3-3 

isovalerate 

250 

600 

2-35 

* Velocity 

constant in slightly 

acid solution. 



The bromination of acetylacetone has been previously studied by Bell & Lidwell 
(1940), but their r^ults are less accurate than those obtained here, since they 
a^umed that the second bromine atom is introduced much more quickly than the 



Kinetics of base-catalyzed halogenation of ketones and esters 315 

first. In our present units their values for k^ were: water 1-37, monochloracetate 19, 
glyooUate 47, acetate 154, trimethylacetate 254. Our present values differ somewhat 
from these, but do not involve any change in their general conclusions. 



Figubb 4 


Table 3. Catalytic constants bob the beomination 

OE BENZOYLAOETONE 



(l./mole, min.) 


, catalyst 

h 

catalyst 

h 

water 

0*66* 

phenylacetate 

207 

monochloracetate 

22 

acetate 

222 

glycoUate 

75 

trimethylacetate 

400 

y^-chloropropionate 

141 

isovalerate 

420 

m-nitrobenzoate 

144 

y^-phenylpropionate 

. 680 

benzoate 

146 




* Velocity constant in sliglitly acid solution. 


In all the experiments with benzoylacetone the plot of logio against t for 

the overall reaction had the same slope as the corresponding plot for the second 
stage of the reaction. This is illustrated by figure 4 which shows the slope Jb as a 
function of anion concentration c for three catalysts. Most of the points represent 
the overall reaction, but for glycoUate catalysis the solid cfrcles refer to the second 
stage; similar results were obtained for the other catalysts, where the points for tho 
two stages lie too closely together to be plotted separately. 



316 


R. P. Bell, E. Gelles and Eva Moller 

, From equation (1) this identity of slopes can arise only if k2 == i.e. f = This 

appears to be so within experimental error for all the catalysts studied, and we have 
therefore tabulated only the catalytic constants for benzoylacetone (table 3 ); those 
for bromobenzoylacetone are half as great. 


Measurements with ethtl aoetoaoetate, ethyl 

a-BBOMOACETOACBTATE AND OTHER SUBSTRATES 


The bromination of ethyl acetoacetate has been studied by Pedersen (1933, 1934), 
who concluded from a laborious analysis of the rate of overall bromination that the 
a-bromoester was brominated about 12 times as fast as the original ester, this ratio 
varying little from one catalyst to another. He considered the possibility of bromina- 
tion experiments with the a-bromoester, but stated that it was insufiSiciently stable 
for quantitative measurements. We have made further measurements on the rate 
of bromination of ethyl a-bromoacetoacetate, partly by preparing it in 3 it% and 
partly by using the pure ester; our results differ considerably from those of Pedersen. 

Ethyl acetoacetate is a considerably weaker acid than acetylacetone or benzoyl- 
acetone, having a dissociation constant of 2 x (Eidinoff 1945), and it therefore 
requires a very large excess of sodium hydroxide to convert it completely to the 
enolate ion. If bromination experiments are carried out with those solutions the 
initial titres are much less than those calculated for 100 % conversion to enol; this 
is due to the alkaline cleavage of the acetoacetic ester, and precludes the direct use 
of the method described in the last section for measuring the rate of bromination of 
the monobromoester. It is, however, possible to prepare sufficiently stable solutions 
with hydroxyl-ion concentration up to about 6 x 10""®, in which the enol content is 
known either by calculation or by extrapolating the observed titres to zero time. 
If the fraction of enol thus found is jE?, the course of the bromination is given by 
equation (1) as n_PW9r_n n 

{r^-n)K = ^ 10 -*’.*. ( 6 ) 


2(r-l) 


2(r-l) 


where is the titre after time t* For the bromination of acetoacetic ester to which no 
alkaH had been added, the corresponding equation is 

( 7 ) 

neglecting the very small proportion of enol ( 0-4 %) in the aqueous solution. Oom- 
bining equations (6) and ( 7 ) we find 

{(r„-r,)-(l-.®)(r„-r;)}/r„ = (8) 

so that a plot of logio{(»'oo —»*<)— (1 — E) (r* —r't)} against t should be a straight line 
of slope fcg. 





% enolate 


initial ester 

concentration of 

^ 



concentration 

KaOH added 

calc. 

obs. 

1 

3-56 X 10-^ 

8*1 X 10-4 

54 

50 

2 

3-65 X 10“^ 

56-0x10-4 

00 

88 




Kinetics of hose-catalyzed halogenation of ketones and esters 317 

The acetoacetic ester was purified by two distillations under reduced pressure, and 
the technique used was exactly the same as for benzoylacetone and acetylacetone. 
The bromine solution contained enough acid to neutralize all the alkali present, so 
that the value of represents rate of bromination of a-bromoacetoacetio ester in the 
absence of catalysts other than the water molecule. Some cleavage of acetoacetic 
ester or its bromination products takes place in the later stages of the bromination, 
and the titres for long reaction times fall short of the theoretical value. However, 
this complication does not affect the first 2 to 3 min. of the reaction, and figure 5 
shows the plots of log^o {(^oo —r^ — {l — E) (fco — ^0} against t for the two experiments 
listed above. The plots are linear and parallel within the experimental error. In 
further confirmation of the value of we have carried out experiments with pure 
a-bromoacetoacetic ester, prepared by the method of Briihl (1903) and purified by 
distillation under reduced pressure (b.p. 97 to 98°/8mm.). An aqueous solution 
prepared from freshly distilled ester liberated the theoretical amount of iodine from 
potassium iodide, and the titre remained constant for some hours, after which it 
feu slowly, probably because of the transformation of a-bromoester into y-bromoester 
(see Macbeth 1923; Kharasch, Sternfeld & Mayo 1937). The bromination experiments 
were carried out with freshly prepared solutions and lasted only a few minutes. The 
plot marked 3 in figure 5 refers to such an experiment in slightly acid solution; it is 
paraUel to the other two plots within experimental error. 



The slope of the lines in figure 6 corresponds to ^ 0^16 (min. log^Q) compared 
with the value of 0^38 given by Pedersen (1933). We are unable to account for this 
discrepancy, especiaUy since later work by the same author using a different experi- 
mental technique (Pedersen 1948) also gives However, the method of 

calculation used by Pedersen is insensitive to variations in k^, a^nd may have been 
affected by a side-reaction ta kin g place to a smaU extent. In any case the value 
obtained for k^ (corresponding to the introduction of the first bromine atom into 
acetoacetic ester) wUl depend very little on whether k^jk^^ 5 (as we have found) or 



318 R. P. Bell, E. Gelles and Eva MoUer 

hlh r<ij 10 (as given by Pedersen) ; in either case a direct plot of logio (^«> - ^t) against 
t for the overall reaction will give without serious error. Thus for the bromination 
in acid solution we find by this method kj_ = 0-030, while Pedersen gives = 0-0312. 

In investigating the effect of anion catalysts we have therefore not attempted to 
evaluate ifcj, but have determined (and the corresponding catalytic constants) 
from logarithmic plots of the overall bromination rate. The results obtained are 
given in table 4. 


Table 4. Catalytic constants eor the bromination 

OE ETHYL ACETOACETATE 



(L/mole, min.) 


catalyst 

h 

catalyst 

h 

water 

0*072* 

propionate 

32 

monochloracetate 

2*28t 

/?-ph.enylpropionate 

34 

m-nitrobenzoate 

6*5 

isobutyrate 

39 

glycoUate 

8*0 

isovalerate 

41 

benzoate 

15 

trimethylacetate 

43 

phenylacetate 

20 

n-valerate 

44 

acetate 

29t 




♦ Velocity constant in slightly acid solution. f Values due to Pedersen (1934). 

For comparison with a-bromoacetoacetic ester some experiments were also made 
with bromomalonio ester. Previous work (Bell et ah 1946) had indicated that this 
ester was brominated considerably more rapidly than malonic ester itself, but no 
quantitative data were available. Bromomalonio ester was prepared as described 
in Organic Syntheses (Collective voL 1, p, 240) and had b.p. 121 to 122®C/14nim. 
Bromination experiments were carried out in slightly acid solutions containing 
about 4 X 10"^M ester, using the technique described by Bell et ah (1946). The plots 
(^00 — against t were linear over approximately the first half of the reaction, 
and two experiments gave for the first-order constant (in min.“^, log^o) = 8-0, 
and 7-7; mean \ = 7-9 x 10-®. This represents the velocity of the water-catalyzed 
reaction, and the corresponding velocity for malonic ester itself is 6-3 x 10^. 


Disotjssion 

In previous work on base-catalyzed reactions relations have been found to exist 
between the catalytic constant of an anion, and its basic strength, conveniently 
measured by where is the dissociation constant of the corresponding acid. 
For halogenation reactions these relations have only been tested for a very limited 
number of catalysts, and it is of interest to see how far they are valid for the more 
extensive range of catalysts studied here. Figure 6 shows a plot of logio ^6 against 
bromination of acetoacetic ester and acetylacetone (see tables 2 
and 4), and it will be seen that the relation is obeyed reasonably well in each case. 
This is not the case in the bromination of bromoacetylacetone and benzoylacetone 
(see figure 7), where there is a general trend towards an increase of wdth decreasing 
but no quantitative relation. 



Kinetics of hose-catalyzed halogenation of ketones and esters 319 

In seeking an explanation of these divergencies, it is noteworthy that the points 
for catalysis by acetate, glycollate and monochloracetate ions lie accurately on a 
straight line for each of the substrates studied, while the remaining catalysts, which 
contain large groups (alkyl, aryl or bromo-groups) attached to the carbonyl, all give 
reaction velocities greater than those interpolated from this line. These positive 



logio^^ 
’FiGxrm 7 


Key to figures 6 and 7. 

1. nionochloracetate 6. phenylacetate 11* iso valerate 

2. w-nitrobenzoate 7. acetate 12. trimethylacetate 

3. glyooUate 8. propionate 13. n-valerate 

4. /?-ohloropropionate 9. /f-phenyl^ropiondte 

5. benzoate 10* isobntyrate ^ 



320 


R. P. Bell, E. Grelles and Eva Moller 


deviations are detectable for acetoaeetic ester and acetylacetone, and become large 
(np to nearly 300 %) for bromoaoetylacetone and benzoylacetone, which also contain 
large groups. It therefore seems reasonable to attribute such deviations to the 
proximity of a large group in the catalyst to a similar group in the substrate, which 
will affect the transition state of the reaction, but will not contribute anything to the 
dissociation constant which is used as a measure of the basic strength of the catalyst. 
There will be some direct interaction between the two groups, but a more important 
factor is probably the necessity of making a cavity in the solvent, thereby doing 
away with some of the interactions between the water molecules. When the two 
groups are close together they will cause the separation of fewer water molecules 
when they are apart, and this factor will tend to make the transition state more 
stable. The order of magnitude of such an effect can be estimated from the work of 
Butler (1937) on the solubilities of homologous series of molecules in water: he found 
that the addition of each GHj group caused on the average an increase of 160 cal./mole 
in the energy of solution, and of 1570 cal./mole in the heat of solution. An effect 
of this kind would therefore be quite adequate to account for the increases of reaction 
velocity referred to above, the largest of which corresponds to a decrease of 800 cal./ 
mole in the free energy of the transition state. 

It seems justifiable to consider that catalysis by the acetate, glycollate and 
oUoracetate ions does not involve effects of this kind, since they do not contain 
large groups, and they conform accurately to an equation of the type 




(9) 


for^ch reaction studied. We have therefore used the results for these three ions 
(and for water catalysis) in making a comparison between the various ketonic 
rabstanees whose halogenation has been studied. Previous work has shown that 
there is a correlation between the value of a in equation (9) and the reactivity of the 
substrate. The latter is conveniently measured by B, which is a measure of the rate 
ot ionization of the ketone in presence of unit concentration of a catalyst with 
= 10 corrected for statistical differences between the different ketones Li 
previous papers k, has not always been given in the same units and we have, therefore 
coUec^ aU the available data in table 5, in which J2 is in l./mole, min. We have also 
modified the method of making a statistical correction for the presence of more than 
one lovable hydrogen m the substrate molecule. In previous papers (BeU & 
Indwell 1940; Bell 1943) several hydrogen atoms on the same carbon were reckoned 
as one hydropn, while m table 6 they are counted separately; e.g. the velocity for 
^tone ^ been divided by 6 rather than by 2. We have made this modification 
ecause bromobenzoylacetone has been found to brominate only half as fast as 
benzoy^tone i^lf, and it seems unlikely that the introduction of a bromine atom 
S decrea^ the tendency to ionize. The choice of statistical correction does not 
any of the mam conclusions drawn. The values of a and J? for a-bromoaceto- 

described in the 

preceding section makes it probable that the values given by Pedersen (1933) for 

8^^°^ tT “ values for acetylacetone havfbeen 

substituted for the earher values of BeU & LidweU. It wiU be seen that the decrease 



Kinetics of hase-catalyzed Tialogenation of ketones and esteTS 321 

in oc with, increasing reactivity, originally suggested on the basis of less complete 
data, is fully borne out by the fuller and more accurate results now available. The 
only marked exception to this regularity is now acetoacetic acid, and the value of 
oc for this substance is not known with certainty, being based on two catalysts only, 
and on extrapolation from 1 8 to 25 ® C. On the other hand, the explanation originally 
advanced to explain the decrease in a (Lidwell & Bell 1940) is probably not wholly 


Table 5 


R = catalytic constant in l./mole, min.) ^ 

«0 = water velocity in inin.-i | corrected 


substrate 

l^Sio ^ 

a 

logio»o 

source 

CH 3 COCH 8 

-6-78 

0-88 

-8-33 

a 

CHaCOCeHs 

-6-07 

— 



b 

CH 3 COCH 2 CH 2 COCH, 

-5-46 

0-89 

-7-45 

a 

CH 3 COCH 2 CI 

-3-61 

0-82 

-6-79 

a 

OHjCOCHaBr 

-3-25 

0-82 

-5-10 

a 

CH 3 COCHCI 2 

- 2-00 

0-82 

-4-36 

a 

CHgCCOOCaHs)^ 

-0-76 

0-79 

-3-14 

a 

CHjCOCHaCOOCaHg 

+ 0-72 

0-59 

-1-44 

a 

C 0 (CH 3 ) 3 CHC 00 C 2 H 5 

+ M 8 

0-68 

- 1*86 

d 

CHBr(COOC 2 H 5)3 

— 

— 

-1-74 

e 

CHaCOCHaCOCeHs 

+ 1-33 

0-62 

-0-48 

e 

CHgCOCHBrCOOeHs 

+ 1-33 

0-52 

-0-48 

e 

CHaCOCHBrCOOCaHs 

— 

— 

-0-46 

e 

CHsCOCHaCOCHa 

+ 1-54 

0-48 

-0-29 

e 

CHaCOCHBrCOCHa 

+2-04 

. 0-42 

+ 0-11 

6 

CHaCOCHjiCOOH 

+ 2-27 

0-48 

+ 0-30 

a 


References: a. Bell & Lidwell 1940; b. Bell 1943; Belletal, 1946; d, Bell etal, 1948; a, present 
paper. 


correct. This explanation assumed that the potential energy curves for proton- 
transfer had the same shape over the whole range of ketones concerned, and that the 
varying value of a corresponded to the change of slope on moving up and down this 
curve. It was difficult to account on this basis for the large changes in a found 
experimentally and more recently (Bell So Higginson 1949) both experimental and 
theoretical evidence has been presented to show that the shape of the potential 
energy curves will depend on the extent to which a change of electronic structure 
takes place on the transfer of a proton. This will account for the low values of a found 
for the yff-diketones and ketonic esters in table 5 , for which the charge in the anion 
is spread fairly evenly over two oxygen atoms. It is of course this same spread of 
charge (or mesomerism) which makes these ketones relatively strong acids and gives 
them high rates of ionization. 

Other points of structural interest emerge from table 5 , notably the effect on the 
ionization velocity of substituting one hydrogen by bromine in the active ^CHg 
group. The data are most complete for the water reaction, though essentially the 
same picture is given by the results for anion catalysis. The relevant figures are given 
in table 6. The effect of bromine substitution decreases in a striking manner as the 



322 


R. P. Bell, E. Gelles and Eva MoUer 

reactivity of the ketone increases, and is relativdy small in the /?-diketones. This 
can again be related to the spread of charge in the anions of the latter. If the bromine 
can increase the stability of the anion by the attraction between the negative charge 
and the positive end of the C"''-Br“ dipole, then this stabilization wiU be diminished 
when the charge is removed almost completely from the carbon by being spread more 
or less evenly over two oxygens. 

Table 6. Eeebct op bbomutb stTBSTmTHON 
r' = (vq for bromok©toixe)/(Vo for unsubstituted ketone), iising values corrected statistically. 


ketone 

^0 

r' 

acetone 

4-5 X 10-» 

1700 

diethylmalonate 

7 xlO-^ 

25 

acetoacetic ester 

3*6 X 10-5^ 

10 

benzoylaeetone 

3-3 X 10-1 

1 

acetylacetone 

5 xlO-i 

2-7 


Our thanks are due to the International Federation of University Women for 
a grant to one of us (E.M.). 


Reeeeences 

Bell, R. P. 1943 Trans- Farada/y Soc- 39, 253. 

Bell, R. P., Everett, B. H. & Longuet -Higgins, H. C. 1946 Froc, Boy. Soo. A, 186, 443. 
Bell, R. P. & Higginson, W. C. E. 1949 Froc. Boy. Soc. A, 197, 141. 

Bell, R. P., Lidwell, 0. M. & Vaughan- Jackson, M. W. 1936 J* Ohem. 80 c. p. 1794. 
Bell, R, P. & Lidwell, 0. M, 1940 Froc. Boy. Soc. A, 176, 88. 

Bell, R, P., Smith, R. D. <fc Woodward, L. A. 1948 Froc. Boy. Soc. A, 192, 479. 

Bruhl, J. W. 1903 Ber. dtscfi. ckem. Gea. 36, 1731. 

Butler, J. A. V. 1937 Trcms. Farada/y Soc. 33, 229. 

Eidinoff, M. L. 1945 Armr. Ghem. Soc. 67, 2073. 

Kharasch, M. S., Stemfeld, E. & Mayo, F. R. 1937 J. Amer. Ohem. Soc. 59, 1655. 
Lidwell, O. M. & Bell, R. P. 1940 Froc. Boy. Soo. A, 176, 114. 

Macbeth, A. K. 1923’ J. Ohem. Soci p. 1122. 

Peat, S., Bourne, E. J. & Thrower, R. D. 1947 Nature, 159, 811. 

Pedersen, K. J. 1933 J. Fhys. Ghem. 37, 751. 

Pedersen, K. J. 1934 J. Fhys. Ghem. 38, 601, 999. 

Pedersen, K. J. 1948 Acta Ghem. Scand. 2, 252. 



The catalytic hydrogenation of methyl elaeostearate, and of 
mixtures of elaeostearic with other polyethenoid 
long-chain esters 

By T. P. Hilditoh, P.R.S., and S. P. Pathae 

Department of Industrial Chemistry, University of Liverpool 
{Received 16 March 1949) 


The hydrogenation (using Kaney nickel catalyst at 110 and 170° C) of the conjugated triene 
system present in methyl elaeostearate (octadeea-9, 11, 13-trienoate) has been compared 
with that of the pentadiene systems ~- 0 H:CH.CH 2 .CH:CH — present in methyl linolenate 
and methyl linoleate. 

The course of the hydrogenation action in methyl ela.eostearate itself has been followed 
spectrophotometrically, and it has been established that, for the most part, the primary 
action is the simultaneous addition of two molecules of hydrogen to one molecule of elaeo- 
stearate and formation of very large proportions of octadeca-1 1-enoic esters. Production 
of stearate does not occur to any marked extent until over 80 % of elaeostearate has been 
converted to mono-ethenoid ester. 

The selectivity of the hydrogenation process has been studied in t^be cases of equi- 
moleoular binary mixtures of methyl elaeostearate with, respectively, methyl oleate, lino- 
leate or linolenate. In a mixture of methyl elaeostearate and oleate, hydrogenation of the 
latter does not set in until nearly aU the elaeostearate has been converted to octadecenoate. 
With a mixture of methyl elaeostearate and linoleate, the diene ester remains almost unattacked 
until 60 % or more of the conjugated triene ester present has been converted into mono- 
ethenoid ester, after which it also commences to be transformed into mono-ethenoid esters. 
In a mixture of methyl elaeostearate and linolenate, both esters undergo hydrogenation from 
the outset, but conjugated triene ester disappears about twice as rapidly as the non-con- 
jugated isomer. The activity to hydrogen of the double pentadiene system 

—OH ; OH . CHa . CH ; CH . CH* . OH : OH— 

is thus comparable with that of the conjugated triene group in the elaeostearate molecule. 

The hydrogenation of methyl linoleate in presence of Raney nickel either at 110 or 170® C 
is extremely selective, no methyl stearate being produced until over 90 % of the linoleate has 
been transformed into octadecenoates. 


In-TRODHOTION 

The phenomenon of selective hydrogenation — ^the, preferential attachment of 
hydrogen in presence of a catalyst to one of two or more centres of unsaturation — 
has been recognized for many years, both in compounds belonging to the terpene 
series and in the long-chain unsaturated acyl groups present in the mixed glycerides 
of the fats (for a summary of the earlier observations, see Armstrong & Hilditoh 
1925). In the aliphatic series, pronounced selectivity of the hydrogenation process 
was first observed in certain vegetable fatty oils (H. K. Moore, Richter & van Arsdel 
1917; C: W. Moore & Hilditoh 1923), in which the Hnoleic glycferides pass almost 
wholly to the mono-ethenoid state before any saturated stearic glycerides are formed. 
Richardson, Kiiuth & Milli gan (1925) reported that the multi-ethenoid glycerides 
present in whale and other marine animal oils behave differently, in that wh^ 
saturated glycerides have been produced in quantity, the remaining unsaturated 

[ 323 ] 



324 


T. P. Hilditch and S. P. Pathak 


V 

glycerides still contain notable amounts of di-ethenoid compounds; this "was later 
..co^nfirmed by BBlditch & TerlesM (1937) and Harper & Hilditch (1937). 

*This seeming difference in behaviour remained unexplained until a detailed study 
of the hydrogenation of the triene methyl linolenate (octadeca-9, 12, 16-trienoate) 
Bailey & Ksher (1946) showed that the primary products are octadeca-9, 16- 
dieneate with about an equal amount of a mixture of ootadeca-9, 12- and -12, 
16-dienoates, and that to a minor extent two molecules of hydrogen are added 
simultaneously to linolenate with direct formation of mono-ethenoid esters. Further, 
these workers gave the relative reactivities of the different unsaturated esters as 
approximately: 

methyl oleate 

0H3.[CHj]7.CH:CH.[CHJ,C02Jfefe 1 
methyl octadeca-9, 15-dienoate 

CH3 . CH3 . CH : CH . [CHj]* . CH : CH . [CHal^COa Jfe 3 

methyl linoleate 

CH3 . [CHa]* . CH : CH . CH3 . CH : CH . [CH^l^COaife 20 
methyl linolenate 

CH3 . CHa . CH : CH . CH^ . CH : CH . CH^ . CH : CH . [CHJ , CO^Me 40 

These results lead irnequivocally to the conclusion (Hilditch 1946) that selective 
hydrogenation in long-chain unsaturated compounds is essentiaUy that of the 
specific pentadiene system — CH: CH.CHg.CH: CH — in which a ‘reactive’ 
methylene group occurs between two double bonds. The apparent persistence of 
di-ethenoid unsaturation during hydrogenation of a compound which initially con- 
tained a multi (4, 5, or 6)-ethenoid system (as in the marine animal oils) is now 
explicable, since primary attack of hydrogen at one or more of the pentadiene 
groupmgs originally present must result in the production of di- or even tri-ethenoid 
derivatives, in which the remaming double bonds are separated by four or more 
methylene groups, and which consequently react to cataljrtic hydrogenation little 
more rapidly than a mono-ethenoid compound. 

The specific properties of the pentadiene grouping towards -hydrogen in presence 
of a catalyst, thus made clear, rendered it of interest to compare the behaviour of 
the corresponding conjugated triene (elaeostearic) system 

CH3 . [CH J3 . CH : CH . CH : CH . CH : CH . [CH^j^ . CO^Me 

with that of methyl linolenate, linoleate and oleate. The hydrogenation of methyl 
elaeosterate itself evidently required further study, since the course of the action 
had been reported differently by earlier workers. Boeseken & Hoogland (1927) and 
Boeseken, van Krimpen & Blanken (1930) stated that reduction of elaeostearic 
glycerides (tung oil) proceeds in accordance with the Thiele mechanism, molecules 
of hydro^n being successively added to produce first a conjugated octadeca-10, 12- 
^enoate and then octadec-ll-enoate, before any saturated derivatives appear. 
St^er, van Loon&vany]immeren(i944), however, found thatoonjugatedunsatura- 
faon disappeared steadily from the commencement of hydrogenation of the oil, 
i.e. no conjugated di-ethenoid reduction products are formed. They confirmed the 



Hydrogenation of methyl elaeostearate 325 

observation that no saturated compounds are produced until the reaction product 
consists practically entirely of mono-ethenoid compounds, but considered that in 
the first phase of the hydrogenation non-conjugated diene derivatives are produced, 
Groot, Kentie & Knol ( 194 ?) l^ter recommended the partial hydrogenation of tung 
oil as a means of obtaining almost pure octadec-ll-enoic acid. 

Methods 

The hydrogenations were carried out (on 50 to 120 g. of esters) in a three-necked 
round-bottomed flask fitted with a mechanical stirrer (ISOOr.p.m.) and maintained 
at either 110 or 170*^ C by an oil bath. Before commencing an experiment the 
apparatus was evacuated and then filled with hydrogen; during the actual hydro- 
genation the volume of hydrogen absorbed was followed approximately by delivering 
it to the reaction flask from a graduated gas container. Samples of the hydrogenated 
product were withdrawn for examination at successive intervals corresponding 
approximately with the absorption of about one-sixth of the volume of hydrogen 
necessary for complete conversion to saturated ester (stearate). 

The catalyst used was Raney nickel, which was prepared from powdered nickel- 
aluminium alloy as described by Pavlic & Adkins ( 1946 ) and kept under alcohol until 
required for use. The concentration of catalyst employed was approximately 2 % 
of the esters. 

Preparation of the unsaturated esters 

The requisite fatty acids were (except in the case of methyl linolenate) prepared 
from natural sources by means of low-temperature crystallization from appropriate 
solvents. In no case were they of 100 % purity, but consisted of materials containing 
high concentrations (usually over 90 %) of the desired compound accompanied by 
minor proportions of known constituents which did not interfere with the object of 
the experiments. In view of the liability of long-chain polyethenoid compounds to 
suffer change during chemical handling, it was decided that this course was to be 
preferred to the more lengthy procedure necessary in order to obtain a completely 
individual ester. 


Methyl elaeostearate 

Since elaeostearic acid tends to polymerize during handling and especially in 
presence of reagents such as sulphuric acid, and since methyl elaeostearate also 
commences to undergo polymerization during distfilation at 0*2 mm. pressure 
through a fractionation column, the method finally adopted was to esterify the mixed 
acids from tung oil (isolated therefrom by rapid hydrolysis with a very small excess 
of alkali) with methyl alcohol at room temperature in presence of about 0*5 % of 
anhydrous hydrogen chloride. Esterification was complete after about 24 hr., and 
the elaeostearic content (77 %) of the mixed esters (after removal of any small 
amounts of free acids present) was found to be the same as that of the tung oil from 
which they had been prepared. The mixed methyl esters (365 g.) were then crystal- 
lized from 10 % solution in acetone at -40'' C, when 212 g. remained in solution; 
these esters, when further crystallized from 10 % solution in acetone at — 60° 0, 


Vol. 198. A. 



326 


T. P. Hilditch and S. P. Pathak 

deposited 103 g. of esters which contained 94 % of methyl elaeostearate (with 6 % 
of oleate and 1 % of saturated esters). This (and analogous batches of similarly 
prepared methyl elaeostearate) was employed in the present hydrogenation studies. 

Methyl oleate 

Neat’s foot oil is a convenient source of oleic acid, since the latter (64 %) is accom- 
panied in this oil by relatively small proportions of linoleio (3 %) and other poly- 
ethenoid acids (2 %). The bulk of the saturated acids present in the mixed fatty acids 
from the oil were removed in the form of lead salts insoluble in alchohol. The acids 
from the soluble lead salts (e.g. 217 g., iodine value 91-6) were first crystallized from 
10 % solution in acetone at —40° C,*when 12.1 g. (iodine value 86*8) were deposited 
and recrystallized from acetone at the same temperature, when 102 g. (iodine value 
83-5) were deposited. These acids were converted into methyl esters which were 
fractionally distilled in order to remove methyl esters of lower acids (mainly palmitic 
and hexadecenoic); 66 g. of methyl oleate (iodine value 84'0; equivalent 295*0; 
calc, iodine value 85*8, equivalent 296*0) were obtained and used in the present 
experiments. Linoleic ester was found to be absent from this ester, which contained 
98 % of methyl oleate and 2 % of methyl stearate. 

Methyl Unaleate 

Sunflower-seed oil with a content of 70 % of linoleic acid was used as the source of 
concentrates of methyl hnoleate accompaiaied by methyl oleate. When the mixed 
acids of sunflower-seed oil are crystallized from acetone at — 60° 0, the material left 
in solution (iodine value 166 or above) is rich in linoleic acid (iodine value 181*4). 
Further crystallization of the methyl esters of this concentrate from acetone at 
— 66 or — 70° C usually leads to the deposition of esters somewhat richer in hnoleate 
(e.g. iodine value 162, calc. 172*8). Esters of similar iodine value were finally dis- 
tilled at 0*2 mm. through a fractionation column, and the fractions of highest iodine 
value were used in the hydrogenation experiments. In all, 163 g. of material of 
iodine value 163*3 was obtained (methyl linoleate 88 %, methyl oleate 12 %). 

* 

Methyl Unolenate 

It is not yet possible to separate linolenic acid from linoleic acid sufficiently for 
the present purpose by crystallization at low temperatures. Linolenic acid was 
therefore prepared by debromination of hexabromostearic acid (m.p. 180° C) with 
zinc and pyridine (Kaufmann & Mestern 1936) and converted into crude methyl 
hnolenate, which when distilled at 0*2 mm. pressure yielded methyl Unolenate 
(iodine value 258, calc, 262*0). 


(i) Spectrophotometric methods. Conjugated triene systems in long-chain acids 
absorb ultra-violet fight in the region of 266 to 276 m/6 with a jvell-defined absorption 
band head at 268 m/6 (extinction coefficient ^iom. 1780) ; analogous conjugated diene 
acids give an absorption band in the ultra-violet spectrum at 234 m/6 {EYim. 1^00). 
Spectrophotometric measurement of extinction coefficients- at these wave-lengths 



Hydrogenation of methyl elaeostearate 327 

thus leads directly to the determination of the proportions of conjugated triene and 
diene compounds present. 

The spectrophotometric method can also be applied to the determination of 
linolenic and linoleic compounds (Hilditch, Morton & Riley 1945). These, although 
non-conjugated, contain the system — CHrCH.CHgCII-.CH— , which, on treat- 
ment with a high concentration of alkali hydroxide in an appropriate solvent at 
about 180° 0 under standardized conditions, undergoes rearrangement to con- 
jugated isomers (diene from hnoleic compounds, diene and also triene from linolenic 
compounds). In the present work the analytical conditions recommended by Hil- 
ditch et uL (1945) were observed, namely, for hnoleic compounds, isomerization "vith 
alkah in glycol solution at 180° C for 60 min. and for linolenic compounds similar 
isomerization at 170° 0 for 15 min: The amount of elaeostearic, hnolenio, hnoleic 
and any conjugated diene compounds present in a mixture containing some or ah 
of these unsaturated compounds can thus be determined from the spectrophotometric 
observations described, due ahowance being made, when determining linolenic acid, 
for the contribution to the extinction coefficient observed at 268 m/i after alkah- 
isomerization due to elaeostearic acid; whilst, when determining hnoleic acid, similar 
deductions from the observed extinction coefficient at 234 m/6 after alkah-isomeriza- 
tion must be made for the contributions due to elaeostearic, hnolenic or any con- 
jugated diene acid. 

The reference values employed for the extinction coefficients for the 

respective pure acids, whether as such or after alkali-isomerization under the 
specified conditions, were as foUows; 

unisomerized after alkali-isomerization 



268m/fc 

234 m/6 

268 m/6 

234 m/6 

elaeostearic acid 

1780 

208 

1690 

197 

linolenic acid 

— 

— 

532 

669 

conjugated diene acid 

— 

1200 

— 

1140 

linoleic acid 

— 

— 

— 

906 


(h) Determination of total unsaturation {iodine value ) . The total amount of ethenoid 
unsaturation in long-chain ahphatic compounds can be determined by various 
methods which depend upon the quantitative addition of iodine mono-hahdes to 
the double bonds when the necessary conditions are observed. With non-conjugated 
unsaturated compounds quantitative addition takes place readily in the course of 
30 mm. in presence of a large excess of iodine monochloride in glacial acetic acid 
solution (Wijs 1898), but conjugated double bonds react only incompletely with the 
Wijs solution. When these are present, however, a reasonably accurate measure of 
the total ungaturation is obtained if the bromine vapour absorption method of Toms 
(1928) is employed, the result being expressed for convenience in terms of 'iodine 
value’, i.e. the amount of halogen (calculated as iodine) which has combined with 
100 parts of the unsaturated compound, or mixture of compounds. Whilst the 
accuracy of determination of unsaturation in mixtures containing conjugated 
unsaturated long-chain acids is not equal to that obtainable when only non-con- 
jugated derivatives are present, the Toms method has proved sufficiently reliable 


22-2 



328 


T, P. Hilditch and S. P. Pathak 

in the present investigation t6 enable ns to follow the course of hydrogenation of 
methyl elaeostearate and of mixtures of this ester with analogous non-con jugated 
esters. 

In general in a mixture of unsaturated long-chain esters, in which elaeostearate, 
linolenate and linoleate have been determined spectrophotometrically (as in (i) 
above), the proportion of oleic or other mono-ethenoid compounds then follows 
from the difference between the observed (Toms) iodine value and the sum of the 
increments of iodine value due to the observed amounts of the polyethenoid esters. 
Any saturated compounds present are then determined by difference, 

(iii) DeUrmination of saturated acids in partially hydrogenated mixtures of esters. 
In the special case of hydrogenation of methyl elaeostearate, or mixtures of this 
with other unsaturated esters, it was desired to determine the proportions of mono- 
and di-ethenoid esters present at progressive stages of the hydrogenation. This 
could be effected if the mean iodine value of the mono- and di-ethenoid esters could 
be calculated. For this purpose it was necessary separately to determine the pro- 
portion of saturated esters present, and the procedure of Bertram (i9Z5) was 
employed. This method, although not of the highest order of analytical accuracy, 
gives results accurate to within about 1 %. The proportions of triene esters (elaeo- 
stearate or linolenate) having been determined spectrophotometrically, and the 
proportion of saturated esters having been determined, the approximate iodine 
value of the remaining components of the system (mono- or di-ethenoid esters) 
can be calculated from the iodine value (Toms) of the original mixture, whence the 
general proportions of the mono- and of di-ethenoid components can be arithmetic- 
ally calculated. 

Rbstjlts A3?rn discussion 

Hydrogenation of methyl linoleate with Raney nickel catalyst 
The behaviour of methyl linoleate with the partly dispersed form of catalyst known 
as Raney nickel has not previously been recorded, and it seemed well to ascertain 
whether the selectivity observed with other forms of catalytic nickel was also shown 
in presence of Raney nickel, especially at the lower (110° C) of the two temperatures 
employed throughout this work. As the data in table 1 show, the hydrogenation was 


Table 1. Hydeogenation of methyl linoleate 




mono-ethenoid 


iodine value 

linoleate 

esters 

saturated 

(Wijs) 

{%) 

(%) 

(%) 


hydrogenation at 110° C 


163*4 

88*3 

11*7 


133*1 

55*4 

44*0 

0*6 

101*7 

18*4 

81*4 

0*2 

78*7 

1*7 

88*1 

10*2 


hydrogenation at 170° C 


163*4 

88*3 

11*7 



138*2 

61*5 

37*2 

1*3 

94*1 

11*2 

87*1 

1*7 

75*3 

0*6 

86*5 

12*9 



Hyd/rogenation of methyl elaeostearate 329 

extremely selective both at 110 and 170° C, production of methyl stearate being 
negligible until 90 % or more of the linoleate had passed to the mono-ethenoid 
condition. The first lines of figures in both parts of table 1 refer to the original 
concentrate of methyl linoleate (of. p. 326) used in the hydrogenations, succeeding 
lines referring to samples withdrawn as the hydrogenations proceeded. In this 
series of experiments the composition of the ester mixtures was determined from the 
iodine value and the extinction coefficient 234 m/^ after alkali-isomerization 

(for details see table 6). 


Hydrogenation of methyl elaeostearate 

A concentrate of methyl elaeostearate, prepared as described (p. 325) and con- 
taming 93 % of methyl elaeostearate with minor proportions of oleate and (possibly) 
linoleate, was hydrogenated with Raney nickel catalyst at 110 and at 170° 0 with 
the results summarized in table 2, In table 2 the data for elaeostearate were deter- 
mined spectrophotometrically, saturated acids were determined by the Bertram 
method (1925), and the figures for mono- and di-ethenoid compounds were based 
upon the calculated iodine value of this group (i.e. after correcting the iodine value 
of the ester mixtures for the observed proportions of elaeostearic and saturated 
compounds). 


Table 2. Hydeogenation oe methyl elaeosteaeate 






di-ethenoid 

mono-ethenoid 

iodine 

elaeostearate 

saturated 

(from residual 

(from residual 

value 

(speotrographic) 

(Bertram) 

iodine value) 

iodine value) 

(Toms) 

(% found) 

(% found) 

(% calc.) 

(%calc.) 



hydrogenation at 110® 

'C 


251*6 

93 


— 

3 

4 

218*0 

86 


4 

— 

10 

199*1 

73 


7 

— 

20 

169*2 

47 


10 

14 

29 

128*0 

27 


13 

7 

53 

76*3 

6 


22 

— 

72 ' 

60*5 

— 


50 

9 

41 



hydrogenation at 170® 0 


251*6 

93 


— 

3 

4 

219*0 

81 


4 

— 

15 

203*0 

67 


6 

4 

, 24 

183*3 

40 


6 

37 

’ 17 

113*3 

16 


9 

9 

66 

68*7 

— 


30 

3 

67 


Rigidly accurate determination of the iodine value of esters containing high pro- 
portions of elaeostearate is difficult to accomplish (even by the bromine vapour 
absorption procedure), whilst the same applies in perhaps lesser degree to the quan- 
titative determination of the saturated acids present. Moreover, the greater part 
of the (calculated) iodine value of the more unsaturated products is contributed by 
elaeostearate, so that the residual iodine value (which represents the mono- and 
di-ethenoid components of the mix ture) is relatively small by comparison with the 



330 


T. P. Hilditoh and S. P. Pathak 

total iodine value observed by bromine vapour absorption. Consequently, no high 
degree of accuracy is claimed for the calculated proportions of mono- and di- 
etbenoid esters, although these figures serve to point unequivocally to the course 
which the hydrogenation has for the most part followed. 

The data in table 2 show that the proportion of saturated esters increases but 
little (especially at 170° C) until practically aU the elaeostearate has disappeared. 
At the same time, the calculated figures in the two final columns indicate that 
(subject to one or two exceptions referred to below) little or no diethenoid esters 
are produced, the first stage of hydrogenation of the elaeostearate being very largely 
the simultaneous addition of four atoms of hydrogen with production of a molecule 
of a mono-ethenoid ester. The absorption spectra in the range 230 to 300 m/t of the 
nrigiTifl.1 ester and of its four products of partial hydrogenation at 110° 0 with iodine 
values 218-0, 199-1, 169-2 and 128-0 (figure 1) exhibit no indication, moreover, of 
the formation at any intermediate stage of any conjugated di-ethenoid ester (as 
postulated by Boeseken et al. (1937, 1930) and subsequently disputed by Steger ei al. 
(1944)). Since an individual conjugated diene ester of a Cu acid has an extinction 
coefficient of 1200 at 234m/4, the production of any significant proportion of 
such an ester would be readily seen from the graphs relating extinction coefficient to 
wave-length, whereas all that is apparent in figure 1 is a steady fall in the magnitude 
of the elaeostearate absorption bands at 268 to 290 m/i unaccompanied by any 
significant effect in the region 230 to 240 m/t. 

The significance, if any, of the minor proportions of di-ethenoid esters which appear 
erratically at some stages (e.g. table 2, iodine values 169-2 (110° C) and 183-3 (170° 0)) 
is uncertain. It would not be unreasonable to attribute such instances to chance 
combinations of accumulated analytical error, but it is curious that direct spectro- 
graphic examination of the partly hydrogenated esters in question has also shown 
smaller but still significant proportions of conjugated diene unsaturation (cf. table 7). 
It may therefore be that at certain stages of the hydrogenation process some 
(relatively minor) production of di-ethenoid unsaturation fi^om elaeostearate (or 
from its nuxture with mono-ethenoid esters already produced from it) may take 
place; this point is again considered later (p. 333). 

The chief primaiy products — ^mono-ethenoid esters — of hydrogenation of methyl 
elaeostearate are probably not homogeneous, although we can confirm the state- 
ment of Groot et al. that methyl octadec-1 1-enoate is produced in major proportions. 
We are of the opinion, however, that mono-ethenoid esters with the double bond in 
other positions (e.g. are almost certainly present in addition 

to the main product, methyl ira?w-octadee-l 1-enoate. We have examined in some 
detail the product obtained by hydrogenating methyl elaeostearate to an iodine 
value of about 76, when almost aU elaeostearate has disappeared. The mixed acids 
obtained by hydrolysis of a hydrogenated ester of this type include a certain pro- 
portion of saturated acids, and it is not possible to effect a quantitative separation 
of the latter from the mono-ethenoid acids by crystallization. Nevertheless, syste- 
matic crystallization from acetone or ether at temperatures down to — 46° C has 
enabled us to isolate from the mixed fatty acids about 36 to 40 % of an acid which 
melted at 39 to 40° C and which appeared to consist largely of iraw-octadeo-O-enoic 



Hydrogenation of methyl elaeostearate 331 

(vaccenio) acid. Undoubtedly this represented only a part of the total amount of 
this acid present, but concurrently there were definite indications of the presence of 
other isomeric mono-ethenoid acids. 



230 240 250 260 270 280 290 300 

m/^ 

Figttiie 1. Hydrogenation of methyl elaeostearate. 0 = original ester, I to IV = hydrogenated 
ester. Iodine values (Toms) as follows: 0, 251 - 5 ; I, 218-0; II, 199d; III, 169*2, • and IV, 128*0. 

Hydrogenation of mixtures of methyl elaeostearate with methyl oleate^ 
linoleate^ or Unolenate 

The courses taken during the hydrogenation of equimolecular mixtures of methyl 
elaeostearate with either methyl oleate, methyl linoleate, or methyl linolenate were 
next examined under the same conditions (Raney nickel catalyst at 1 10 and 170® C) 
with the results illustrated in tables 3, 4 and 5. 

Methyl elaeostearate and methyl oleate {table 3). The partly hydrogenated products 
were examined as* in the case of methyl elaeostearate alone, i.e. the proportions of 
elaeostearate and of saturated esters were determined analytically, and the residual 
iodine value of each product calculated to a mixture of mono- and di-ethenoid esters. 

Table 3 shows very clearly that no oleate is hydrogenated to stearate until all the 
elaeostearate has been converted into the mono-ethenoid condition, and confirms 
that this is the main primary process in the hydrogenation of the conjugated triene 
ester, although (as in the case of methyl elaeostearate alone) erratic appearances of 
(calculated) di-ethenoid esters occur, apparently at an iodine value of about 120. In 
these mixtures, however, no corresponding appearance of conjugated diene esters 



332 T. P. Hilditch and S. P. Pathak 

was observed by direct spectrographic examination of the hydrogenated products 
(of. table 8). 

Methyl daeostearate and methyl linolecde [table 4). In this series of experiments 
elaeostearate was determined spectrophotometricaUy, linoleate by similar speotro- 
photometric analysis after the mixtures had been iosmerized with alkali at 180° C 
for Ihr., and saturated components by oxidation (Bertram), the mono-ethenoid 
esters being determined by difference. 


Table 3. Hydeogbnation oe methyl elaeosteaeate with methyl oleate 

di-ethenoid mono-ethenoid 


iodine 

elaeostearate 

saturated (from residual 

(from residual 

value 

(spectrographic) 

(Bertram) iodine value) 

iodine value) 

(Toms) 

( % found) ( % found) 

hydrogenation at 110° 0 

(% calc.) 

(% calc.) 

163*6 

45 

, 4 

— 

51 

132*0 

, 28 

4 

1 

67 

120*8 

12 

5 

21 

62 

80*1 

1 

9 

— 

90 

60*1 

— 30 

hydrogenation at 170° C 


70 

163*6 

46 

4 

— 

61 

135*3 

28 

4 

3 

65 

119*3 

11 

5 

21 

63 

93*7 

1 

6 

14 

79 

66*7 

— 

32 

— 

68 


Table 4. Hydeooenatioh oe methyl elaeosteaeate 

' WITH METHYL LIIIOLBATE 


iodine value 
(Toms) 


193-3 

159-7 

131-0 

92-0 

61-3. 


193-3 

151-3 

121-3 

86-9 

50-7 


elaeostearate linoleate mono-ethenoid 
(spectrographic) (spectrographic) (by difference) 
( % found) ( % found) ( % calc.) 

hydrogenation at 110° C 
45 48 7 

25 48 22 

8 33 64 

1 11 81 

* — 1 71 


hydrogenation at 170° 0 


45 48 7 

20 49 28 

4 32 60 

— 11 82 

— 1 67 


saturated 
(Bertram) 
(% found) 


5 

6 
7 

28 


3 

4 
7 

42 


At both 110 and 170° C linoleate remained unattacked until the concentration of 
elaeostearate had fallen to about 20 % or less of the total esters present; thereafter, 
it underwent hydrogenation concurrently with elaeostearate but when the latter 
had practically disappeared the mixtures stiU contained about 25 % or more of 
unchanged methyl linoleate. Since the primary hydrogenation products of both 
elaeostearate and linoleate are mono-ethenoid esters, and hydrogenation in both 



Hydrogenation of methyl elaeostearate 333 

cases is extremely selective with reference to the mono-ethenoid stage, the con- 
centration of the latter esters rises to over 80 % (or probably .higher) before any 
substantial production of stearate sets in. 

Direct spectrophotometric examination of the hydrogenated esters (cf. table 9) 
showed only negligible amounts of conjugated diene ester throughout the series at 
1 10° 0, but somewhat over 10 % at 1 70° 0 by the time that the linoleate was about to 
commence to undergo hydrogenation. This accords with the observation of Waterman 
& van Vlodrop (1936) ‘th^it linoleate undergoes partial isomerization to conjugated 
forms when exposed at temperatures of about 200° C or higher to catalytically active 
nickel under conditions which preclude its actual combination with hydrogen. 

Methyl elaeostearate and methyl linolenate {table 5). In this series elaeostearate 
and any small proportions of conjugated diene compounds were determined by 
direct measurement of the extinction coefficients at 268 and 234 m/^ respectively, 
and linolenate was determined spectrophotometrically from the value of J?} cm. at 
268 m/^ after alkali-isomerization at 170° C for 15 min. (Hilditch et ah 1945). The 
proportions of mono- and di-ethenoid esters in the products were then estimated 
from the residual iodine value (cf. table 10) calculated after deducting from the total 
iodine value the increments due to elaeostearate and linolenate. 


Table 5. Hydrogenation oe methyl elaeostearate 

WITH METHYL LINOLENATE 

di-ethenoid 

elaeostearate linolenate (from residual 
iodine value (spectrographic) (spectrographio) iodine value) 
(Toms) (% found) (% found) (% calc.) 


266‘7 47 

194-8 21 

169-9 7 

108-1 — 

266-7 47 

191-4 18 

166-6 3 


112-4 


hydrogenation at 110° C 
48 
33 
20 
3 

hydrogenation at 170° C 
48 
38 
24 
3 


mono-ethenoid 
(from residual 
iodine value) 
(% oalc.) 


6 

— 

17 

29 

31 

42 

19 

78 

6 

.. 

8 

36 

26 

47 

24 

73 


Mixtures of linolenate and elaeostearate esters differ strikingly from those pre- 
viously discussed (elaeostearate with linoleate or oleate) in that both the non- 
conjugated and the conjugated triene esters combine with hydrogen from the outset 
of the reaction. Nevertheless, the rate of hydrogenation of elaeostearate is at least 
twice that of the non-conjugated linolenate, the former having almost completely 
disappeared .when about half of the linolenate has been attacked. Little if any 
conjugated diene ester was detected during the course of hydrogenation at either 
110 or 170° C. 


A possible explanation of the appearance of small proportions of diene esters during 
hydrogenation of methyl elaeostearate might conceivably be the donation of hydro- 
gen by oleate or other mono-ethenoid ester to the conjugated triene ester, the diene 



334 


T. P. Hilditoh. and S. P. Pathak 

compounds thus arising from dehydrogenation of a mono-ethenoid ester. Methyl 
elaeostearate (1 part) and methyl oleate (2 parts) were therefore stirred in the 
hydrogenation vessel for 2 hr. with Raney nickel in an atmosphere of nitrogen. 
Whilst small increases in the linoleate content (determined speotrophotometricaUy 
after alkali-isomerization, table 11) were observed both at 110 and at 170° C, there 
was httle corresponding diminution in the proportion of elaeostearate present at 


the conclusion of the experiments; 

elaeostearate linoleate 
(% found) (% found) 

original ' 32 1 

in nitrogen for 2hx. at 110° C 31 6 

in nitrogen for 2 hr. at 170° 0 30 8 


On the other hand, after heating with Raney nickel in an atmosphere of nitrogen 
at 180° C for 3 hr., no formation of diene esters was detectable (linoleate in original 
ester 0-6 %, after the experiment 0-4 %). 

The cause of tl^p occasional appearance of diene unsaturation in minor proportions 
at certain stages of the hydrogenation of elaeostearic esters has thus not been 
ascertained during the course of the present work. 

CosrOLUSIONS 

We believe that the most interesting feature revealed by this investigation is the 
comparable reactivity to hydrogen in presence of nickel of the unsaturated systems 

— CH; CH.CHa.CH: CH.CHa.CHiCH— and — OH: CH.CH: CH.CH: CH— . 

It may be deduced from the present data together with those of Bailey & Fisher 
(1946) for oleate, linoleate and linolenate that the relative reactivities to hydrogen 
of elaeostearate, linolenate, hnoleate and oleate are of the order 80:40:20; 1. The 
pentadiene carbon system in linoleic ester and especially the doubled pentadiene 
arrangement in the linolenic series are thus relatively much more akin to the 
conjugated triene grouping than to an isolated ethylenic group in their behaviour to 
hydrogen in presence of catalytic nickel. 

The present studies have also defined more clearly the course of hydrogenation of 
the elaeostearic conjugated triene system, in which the primary phase has been 
shown to consist almost whoUy in the simultaneous addition of four atoms of 
hydrogen to the elaeostearate molecule, with the production in one stage of a mono- 
ethenoid, or more probably a mixture of isomeric mono-ethenoid, esters. It has also 
been made clear that selectivity of hydrogenation is as well defined at the relatively 
low temperature of 110° C as at higher temperatures, if the nickel catalyst employed 
is in a form which is suitably active at a low temperature. 

Analytical data 

The analytical determinations upon which are based the results discussed in the 
preceding section and enumerated in tables 1 to 6 are summarized for convenience 
at this point (tables 6 to 11). The methods employed in the various determinations 
have been described earlier in this paper (cf. pp. 326 to 328). 



Hydrogenation of methyl elaeostea/rate 335 

Table 6. Hydeogenation with methyl ltnolbate 


at 110° 0 


linoleate (spectrophotometric, 
iodine after alkali-isomerization) 

value ( ^ ^ 


(Wijs) 

234 m/i 

% linoleate 

163‘4 

800 

88*3 

133*1 

502 

65*4 

101-7 

167 

18*4 

78*7 

16 



at 170^ C 


linoleate (spectrophotometric. 


iodine 

value 

(Wijs) 

after alkali-isomerization) 

234m/4 

% linoleate 

163*4 

800 

88*3 

138*2 

667 

61*5 

94*1 

101 

11*2 

85*3 

5 

0*6 



Table 7. Hydeogenation oe methyl elaeosteaeatb 

spectrophotometric analyses (imisomerized) 

K 

calc, iodine 


f 



^ 


iodine 


conj. 


conj. 

saturated 

value of 

value 


triene 


diene 

(Bertram) 

mono- + 

(Toms) 

BYL, 268 m/i 

(%) 

234m/^ 

(%) 

(%) 

di-ethenoid 




at 110° C 




261*5 

1659 

93*2 

165 

— 

— 

126 

218*0 

1534 

86*2 

259 

6*6 

4*3 

— 

199*1 

1294 

72*7 

228 

6*4 

7-6 

50 

169*2 

841 

47*3 

212 

9*4 

10-1 

120 

128*0 

477 

26*8 

138 

6*8 

13-0 

101 

76*3 

101 

6*7 

162 

12*0 

22-2 

90 

50*5 

1 

— 

— 

— 

50-1 

106 




at 170° C 




251*5 

1659 

93*2 

155 

— 

— 

125 

219*0 

1438 

80*8 

272 

8*7 

3*8 

56 

203*0 

1194 

67*1 

277 

11*4 

4*7 

104 

183*3 

716 

40*2 

270 

15*5 

5*9 

153 

113*3 

279 

15*7 

191 

13*2 

9*4 

101 

68*7 

8 

— 

32 

2*6 

29*6 

94 

?ABLE 8. 

Hydeogenation oe methyl blabostbaeate 

WITH METHYL OLEATE 


spectrophotometric analyses (unisomerized) 



X ^ X 

A 




calc, iodine 

iodine 

f 

conj. 


\ 

conj. 


values of 

value 


triene 


diene 

saturated 

mono- + 

(Toms) 

268 m/4 

(%) 

234m/4 

(%) 

(%) 

di-ethenoid 




at 110° 0 




153*6 

809 

45*4 

155 

5*0 

4*2 

90 

132*0 

600 

28*1 

114 

4*6 

4*2 

91 

120*8 

219 

12*3 

67 

3*4 

6*2 

113 

80*1 

23 

1*3 

20 

1*4 . 

8*9 

90 

60*1 

5 

0*3 

8 

0*6 

29*5 

89 




at 170^0 




153*6 

809 

46*4 

156 

5*0 

4*2 

90 

135-3 

606 

28*4 

146 

7-2 

3*9 

95 

119*3 

201 

11*3 

88 

5*4 

4*8 

112 

93*7 

12 

0*7 

20 

1*5 

6*4 

104 

56*7 

3 

— 

7 

0*6 

31*9 

84 



336 T. P. Hilditoh and S. P. Pathak 

Table 9, Hydbogbnation oe methyl elaeosteabatb 

WITH METETYL LENOLBATE 

spectrophotometric analyses 

( ^ ^ ^ 

■unisomerized after alkali-isomerization 


iodine 

value 

(Toms) 

268m/6 

conj. 

triene 

(%) 

conj. 

diene 

EYL. 234m/^ (%) 

234 m/i 

diene 

(%) 

saturated 

(Bertram) 

(%) 

193*3 

806 

46*3 

atllO^^C 

70 — 

526 

48*2 


169*7 

445 

25*0 

102 4*1 

484 

48*0 

4*9 

131*0 

136 

7*6 

45 2*4 

309 

32*6 

6*4 

92*0 

11 

0*6 

17 1*3 

101 

11*0 

7*4 

61*3 

2 

— 

8 0*6 

5 

0*6 

28*2 

193*3 

806 

46*3 

at 170° C 

70 — 

526 

48*2 


151*3 

350 

19*7 

180 11*6 

484 

49*2 

2*9 

121*3 

83 

4*6 

90 6*7 

296 

31*7 

3*8 

86*9 

3 

— 

12 1*0 

104 

11*4 

6*9 

60*7 

2 

— 

9 • 0*8 

11 

1*2 

42*3 


Table 10. Hydrogenation of methyl elaeostearate 

WITH METHYL UNOLENATE 
spectrophotometric analyses 

A 

t — ^ 

unisomerized after alkali-isomerization 



t 


^ 

^ 

1 


-A 

\ 

calc, iodine. 

iodine 


conj. 


conj. 


lino- 



value of 

value 


triene 


diene 


lenate 


diene 

mono- + 

(Toms) 

268 mfi 

(%) 

234 m/t 

{%) 

EYL 268 m/i 

(%) 

234 mA 

(%) 

di-ethenoid 





at 

110° C 





256*7 

839 

47*1 

80 

— 

1052 

48*2 

386 

2*1 

177 

194*8 

374 

21*0 

93 

4*1 

529 

32*8 

321 

10*3 

124 

159*9 

123 

6*9 

80 

5*6 

222 

19*8 

221 

10*4 

129 

108*1 

2 

— 

8 

0*6 

19 

3*2 

121 

11*3 

108 



» 


at 

O 

o 

O 





256*7 

839 

47*1 

80 

' — 

1062 

48*2 

386 

21 

177 

191*4 

328 

18*4 

80 

3*5 

513 

37*8 

339 

9*7 

107 

155*6 

59 

3*3 

108 

8*4 

182 

23*7 

242 

11*1 

122 

112*4 

2 

— 

5 

0*4 

18 

3*1 

106 

9*6 

113 


Table 11. Methyl blaeosteaeate (1 baet) and methyl olbate (2 paets) 
WITH Rahey niokel ih hiteoobh atmospheee 

spectrophotometric analyses 


^ ^ 

unisomerized after alkah- 

t ^ ^ isomerization 


elaeostearate-oleate mixture: 
original 

268 mA 

conj. 

triene 

(%) 

234 mA 

conj, 

diene 

(%) 

/ ^ 

234 mA 

diene 

(%) 

565 

31*7 

60 


76 

1*5 

after2hr. at 110°C 

555 

31*2 

86 

2*7 

118 

6*3 

after 2 hr. at 170° C 

538 

30*2 

161 

10*9 

131 

7*8 

methyl oleate: 
original 





5 

0*5 

after 3 hr. at 180° C 

— 

— 


— 

4 

0*4 



Hyd/rogenation of methyl elaeostearate 337 

Reeterenobs 

Armstrong, E. F. & Hilditoh, T. P. 1925 Proo. Roy. Soo. A, 108, 121. 

Bailey, A. E. & Fisher, G. S. 1946 Oil ds Soap, 23, 14. 

Bertram, S. H. 1925 JZ?. dtsch. Ol- u. Fettindustr, 45, 7^3. 

Boeseken, J. & Hoogland, J. 1927 Eec. trav. Ohim, Pays-Bas, 46, 629. 

Boeseken, J., van Krimpen, J. & Blanken, P. L. 1930 Rec. trav. Chim. Pays-Bas, 49, 247. 
Groot, E. H., Kentie, A. & Knol, H. W. 1947 Rec. trav. Ghim. Pays-Bas, 66, 633. 

Harper, D. A. & Hilditch, T. P. 1937 J. Soc. Chem. Ind., Lond., 56, 322. 

Hilditch, T. P. 1946 NaPure, 157, 686. 

Hilditch, T. P., Morton, H. A. & Riley, J. P. 1945 Analyst, 70, 68. 

Hilditoh, T. P. & Terleski, J. T. 1937 J. Soc. Ghem. Ind., Lond., 56, 315. 

Kaufman, H. P. <fe Mestern, H. E. 1936 Ber. dtsch. chem. Ges. 69 [b], 2684. 

Moore, C. W. & Hilditch, T. P. 1923 J. Soo. Chem. Ind., Lond., 42, 15^. 

Moore, H. K., Richter, 0. A. & van Arsdel 1917 J. Industr. Engng Ghem. 9, 451. 

Pavlic, A. A. & Adkins, H. 1946 J. Amer. Ghem. Soc. 68, 1471. 

Richardson, A. S., Knuth, C. A. & Milligan, C. H. 1925 Inditstr. Engng Ghem. 17, 80, 

Steger, A., van Loon, J. & van Vlimmeren, P. J. 1944 Fette u. Seifen, 51, 49, 

Toms, H. 1928 Analyst, 53, 69. 

Waterman, H. I. & van Vlodrop, C. 1936 J. Soc. Chem. Ind., Lond., 55, 320T. 

Wijs, J. J. A. 1898 Z. angew. Ghem. 12, 291; Z. anal. Chem. 37, 277. 


Initiation of solid explosives by impact and friction: 
the influence of grit 

By F. P. Bowdew, F.R.S. and 0. A. Gtteton 

Laboratory for the Physics artd Chemistry of Eubbing Solids, 
Departpient of Physical Chemistry, Cambridge 

{Beceived 17 January 1949) 

This paper describes an experimental study of the initiation of solid explosives, and in par- 
ticular the effect of artificially introducing transient hot spots of known maximum tem- 
perature. This was done by adding small foreign particles (or grit) of known melting-point. 

The minimum transient hot-spot temperature for the initiation of a number of secondary 
€tnd primary explosives has been determined in this way. It is shown that the mdting-poimP 
of the grit is the determining factor, and all the grits which sensitize these explosives to initia- 
tion either by friction or impact have melting-points above a threshold value which lies 
between 400 and 660® 0- Grit particles of lower melting-point do not sensitize the explosives. 

The same explosives initiated by the adiabatic compression of air required, for initiation, 
minimum transient temperatures of the same order as the threshold melting-point values. 

The results provide strong evidence that the initiation of solids as well as of liquids by 
friction and impact is thermal in origin and is due to the formation of localized hot spots. 
There is evidence that in the case of the majority of secondary explosives which melt at 
comparatively low temperatures, intergranular friction is not* able to cause explosion and 
the hot spots must be formed in some other way. With the primary explosives which explode 
at temperatures below their melting-points, hot spots formed by intergranular friction can 
be important. 

Iotrodttctiok 

An experimental study of the initiation of explosion in liquids by impact and by 
friction has recently been described (Bowden, Stone & Tudor 1947). It has been 
shown (Bowden, Mulcahy, Vines & Yoflfe 1947) that in general the initiation is not 



338 F. P. Bowden and 0. A. Gurton 

a tribochemical one. That is to say, initial chemical decomposition is not produced 
by a direct rupture of the molecule, nor is the chemical excitation due to the rapid 
shearing of adjacent molecular layers. The mechanical energy of the blow (or of 
the rubbing) must first be degraded into heat to form a hot spot of small but fimte 
size. Thermal initiation then occurs at this hot spot. It was found that with liquid, 
gelatinous or plastic explosives these hot spots were formed most readily by the 
adiabatic compression of minute included gas bubbles (Bowden, Mulcahy, Vines 
& Yofife 1947), but they may also be formed by the friction between rubbiug sur- 
faces, and in extreme cases by the viscous heating of rapidly flowing explosive 
heating itself. If small gas bubbles are present the explosion is rendered extremely 
sensitive to impact, and ignition may occur with the gentlest of blows. There is some 
indication that initiation may occur when the pressure ratio for the adiabatic 
compression of a small air bubble is c. 30:1 (Eirich & Tabor 1948). This would give 
a temperature for the gas in the bubble which is about 500° C. 

If the hot spot is formed by Motion on a solid surface, there is evidence that the . 
maximum temperature rise is usually limited by the melting-point of the surface 
(Bowden & Ridler 1936). By using surfaces of known melting-point, it is therefore 
possible to fix and to limit the transient temperature which can be readily generated 
by Motion or impact. It has been found that for a liquid explosive, such as nitro- 
glycerine, being rubbed between metal surfaces, initiation does not occur unless 
the melting-point of the metals is c. 480° 0 or higher. Direct measurements of the 
surface temperature between different metals rubbing together have also been 
made by using them as a thermocouple. If nitroglycerine is present between the 
surfaces it is again found that initiation does not occur until the local temperature 
at the points of rubbing contact is greater than 460° C. 

Measurements by a variety of experimental methods show that the hot spots 
generated by Mction and impact are transient and last for a short time, which 
depends on the experimental conditions but which is usually of the order of 10"^ 
to 10”-^ sec. For this reason, and also probably because of their small size (recent 
measurements by Thomas (1949) indicate that the frictional hot spots on rubbing 
solids may be 10“® to 10“^ cm. in diameter) the hot-spot temperature ( > 450° C) 
necessary to ignite the explosive is appreciably higher than the conventional 
ignition temperatures, which for nitroglycerine, heated in btdk, is about 200 to 
250° C. 

These hot spots also play an important part in the initiation of decomposition 
in sohd explosives. This paper will describe some simple experiments on the 
initiation of solid explosives by Mction and in particular the effect of artificially 
introducing into the explosive small solid foreign particles of known melting-point. 
It is well known of course that the introduction of solid particles ('grit’) has a 
sensitizing effect on explosives. The classical experiments on this are those of 
Taylor & Weale (1938), who studied the effect of glass and of carborundum particles 
on the impact sensitivity of a number of explosives. They concluded that the 
initiation mechanism was a tribochemical one, and it is usually considered that it 
is the hardness of the particle which is its most important property in determining 
the sensitizing effect. Recently, Copp, Napier, Nash, Powell, Skelly, Ubbelohde & 



Initiation of ^olid explosives 339 

Woodward ( 194 ^) have made some measurements on the friction sensitivity in 
the presence of grit. They did not observe any effect unless the hardness of the 
particle was greater than 4 on the Mohs scale. 

^-^owever, if the sensitization is really due to the formation of a hot spot on the ' 
surface of the grit particle, we should expect the melting-point of the grit particle 
itself to be of primary importance (Bowden & Gurton 1948). The work described 
in this paper shows that this is indeed so for solid as well as for liquid explosives, 
and that unless the melting-point of the grit particle exceeds 400 to 600° 0 it does 
not, rmder the experimental conditions used, initiate explosion. 

A second paper describes a high-speed camera study of explosions of liquids and 
solids initiated at a hot spot. It shows how these explosions develop from a com- 
paratively slow biiming into a high-speed detonation. A third paper, by Dr Yoffe, 
deals more fully wth the initiation of liquid explosives by the adiabatic com- 
pression of gas. It also discusses the formation of hot spots by the rapid compression 
of air and other gases entrapped between the crystals of a solid and the part this 
may play in the initiation of solid explosives. 


1. iNITIATIOlSr OF EXPLOSION BY PRIOTION 
Experimental 

An apparatus was constructed in which a thin layer of solid explosive could be 
subjected to rapid shear while held under a considerable load. The apparatus is 
shown in figure 1. 

A thin layer of explosive was placed on the upper surface of the bar A, under the ’ 
steel cylinder B. By screwing S, a suitable load was applied to the explosive, and 
was measured by the bending of the beam B, or by using a loaded lever arm in 
place of B. The pendulum bob P was then raised to a suitable height and allowed 
to fall, turning about the baU-race pivot. The impact on A forced the bar along, 
since it was free to slide on the lubricated surface of the fixed block O. In this way 
the explosive was suddenly subjected to a rapid shearing action. Under these 
conditions the explosive was highly compressed before the experiment, and the 
possibility of including gas was small. 

Results 

When P.E.T.N. (penta-erythritol tetranitrate) was subjected to this experiment 
no explosions were obtained when the highest loads and rates of shear were em- 
ployed, However, if a few small particles of glass were introduced into the explosive 
before the test was carried out, explosion was obtained in every experiment. The 
initial size of the particles used was of the order of 100/t. This experiment was 
repeated with other grits of known melting-point. The materials used as grits were 
crushed minerals and crushed inorganic salts. Wherever possible the salts were 
fused before crushing in order to remove occluded moisture. The minerals chosen 
were among the softer ones (hardness on the Mohs scale 2 to 3‘6), so that the 
difference in hardness between most of the materials used was not very great. 



340 F. P. Bowden and 0. A. Gurton 

Table 1 gives the explosion efficiencies obtained with a number of grits together 
with the hardnesses and melting-points of the materials. For convenience the 
impact results, which will be discussed later in the paper, are included in the last 
two columns of tables 1 to 7. If the friction explosion efficiencies set out in column 4 
are compared with the hardness values in column 2, no obvious correlation can 
be observed, but if they are compared with the melting-points of the grits as set 
out in column 3, a remarkably sharp division is apparent. All grits of melting- 



scale 

FiGtJitE 1. Appajratus for the initiation, of explosion by friction. 

point greater than 430° C were effective in causing explosion, while all grits less 
than about 400° 0 were ineffective. Since the highest temperature reached by 
rubbing two solids together is limited by the melting-point of the lower melting 
solid, it follows that the hottest spot on a piece of grit rubbed on steel, or on another 
piece of grit, will not, in general, have a temperature above the melting-point of 
the grit. Thus these experiments show that when hot spots of 430° C and upwards 
are produced in P.E.T.N., explosion usually follows. In the absence of any hot 
spot greater than 400° C no explosion occurred at the highest rates of shear used 
in these experiments. 


Initiation of solid explosives , 341 

Oyclonit© gave results which were essentially similar to those obtained with 
P.E.T.N. Again grits of high melting-point were much more effective tha,n grits of 
low melting-point, and no explosions were obtained if the explosive was pure, 
or mixed with any grit of melting-point less than 400® C. A few results have been 
set out in table 2. 


Table L Initiation of P.E.T.N. by friction and by 

IMPACT IN THE PRESENCE OF GRIT 





friction 

initiation 

impact 

initiation 



m.p. 

explosion 

efficiency 

explosion 

efficiency 

grit added 

hardness 

("C) 

no. 

(%) 

no. 

(%) 

nil 

— 

— 

0/10 

0 

2/97 

2 

arnmonium nitrate 

2 to 3 

169*6 

0/5 

0 

1/39 

2*5 

potassium bisulphate 

3 

210 

0/5 

‘ 0 

1/41 

2*5 

silver nitrate 

2 to 3 

212 

0/5 

0 

1/49 

2 

sodium dichromate 

2 to 3 

320 

0/5 

0 

0/77 

0 

sodium acetate 

1 to 5 

324 

0/5 

0 

0/20 

0 

potassium nitrate 

2 to 3 

334 

0/5 

0 

0/52 

0 

potassium dichromate 

2 to 3 

398 

0/5 

0 

0/50 

0 

silver bromide 

2 to 3 

434 

5/10 

50 

4/65 

6 

lead chloride 

2 to 3 

501 

6/10 

60 

8/29 

27 

sHver iodide 

2 to 3 

550 

5/5 

100 

— 

— 

borax 

3 to 4 

560 

5/5 

100 

6/20 

30 

bismuthinite 

2 to 2*5 

685 

5/5 

100 

5/14 

42 

glass 

7 

800 

5/5 

100 

6/6 

100 

rook salt 

2 to 2*5 

804 

5/10 

50 

3/50 

6 

chalcocite 

3 to 3*5 

1100 

5/5 

100 

6/12 

50 

galena 

2*5 to 2*7 

1114 

5/5 

100 

6/10 

60 

calcite 

3 

1339 

5/5 

100 

6/14 

43 

Table 2 . Initiation of oyolonitb by eeiotion 







friction initiation 





m.p. 


' 'S 


grit 


hardness 

rc) 

no. 

(%) 


nil 


— 

— 

0/6 

0 


sodium dichromate 

2 to 3 

320 

0/6 

0 


potassium nitrate 

2 to 3 

334 

0/5 

0 


potassium dichromate 

2 to 3 

398 

0/5 

0 


silver bromide 


2 to 3 

434 

4/10 

40 

' 

lead chloride 


2 to 3 

501 

2/10 

20 


silver iodide 


2 to 3 

550 

6/6 

' 100 


borax 


3 to 4 

560 

6/6 

100 


glass 


7 

800 

6/6 

100 



Experiments have been carried out on this apparatus with four initiatiE^ ex- 
plosives, namely, lead azide, lead trinitroresorcinate, tetrazene and mercury ful- 
minate, and there was one marked difference m* the results. When these explosives 
were subjected to the stringent conditions of test used in the experiments with 
P.E.T.N. and cyclonite, they exploded in the absence of any grih. In consequence 
the load applied was reduced to 64 kg., and the maximum height of fall of the 
pendulum to give no explosions was determined. Table 3 shows the relative sensi- 
tivities of the four explosives. 


Vol. 198 . A. 


23 



342 


F. P. Bowden and 0,. A. Gurtoh 


A fimdamental difference between these explosives and the secondary explosives 
(P.E.T.N. and cyclonite) is shown in their behaviour when heated slowly.^The 
secondary explosives melt before decomposmg, while the primary explosives 


TiBIiE 3. iNITIATIOir OF EXPLOSIVES BY FMOTIOIT IN THE ABSENCE OF GKCT 


Motion experiment 





height 

explosion efficiency 


m.p. 

load 

of fall 

• — 



explosive 

ro 

(kg.) 

(cm.) 

no. 

(%) 

P.E.T.]Sr. 

141 

1600 

70 

0/10 

0 

cyclonite 

200 

1600 

70 

0/5 

0 

lead azide 

>335 

1600 

70 

3/3 

100 



64 

70 

1/10 

10 



64 

60 

0/12 

0 

lead stypbnate 

>250 

64 

60 

8/10 

80 




45 

3/^ 

60 




40 

0/15 

0 

mercury fulminate 

>145 

64 

5 

1/10 

10 




H 

0/20 

0 


Table 4. Iottiation of lead azide by fbiction 


grit 

hardness 

nil 

— 

silver nitrate 

2 to 3 

silver bromide 

2 to 3 

lead chloride 

' 2to 3 

silver iodide 

2 to 3 

borax 

3to4 

bismuthinite 

2 to 2-5 

chalcoeite 

3 to 3-6 

galena 

2-6 to 2-7 

calcite 

3 


m.p. 

('^C) 

Motion initiation 
(64 kg., 60 cm.) 

no. 

(%)' 

— 

0/12 

0 

212 

0/5 ' 

0 

434 

0/10 

0 

501 

3/10 

30 

550 

6/6 

100 

560 

5/5 

100 

685 

5/5 

100 

1100 

5/5' 

100 

1114 

5/5 

100 

1339 

5/5 

100 


Table 5. Initiation' of lead styphnate 


grit 

nil 

^ver nitrate 
potassium nitrate 
potassium dichromate 
silver bromide 

lead chloride 
silver iodide 
borax 

bismuthinite 

cbalcocite 

galena 

caleite 


Motion initiation impact initiation 



m.p., 

(64 kg.. 

40 cm.) 

(240 g., 40 cm.) 

hardness 


no. 

^o) 

no. 

(%) 

— 

— 

0/15 

0 

2/59 

3-5 

2 to 3 

212 

0/17 

0 

0/20 

0 

2 to 3 

334 

0/30 

0 

— 

— 

2 to 3 

398 

0/20 

0 

— 

— 

2 to 3 

434 

1/35 

3 

0/20 

0 

2 to 3 

501 

4/19 

21 

1/55 

2 

2 to 3 

550 

5/6 

83 

5/5 

100 

3 to 4 

560 

5/7 

72 

5/5 

100 

2 to 2-5 

685 

5/5 

100 

5/12 

40 

3 to 3*5 

1100 

5/5 

100 

10/10 

100 

2-5 to 2-7 

1114 

5/5 

100 

8/10 

80 

3 

1339 

14/15 

93 

9/11 

80 



Initiation of solid explosives • 343 

investigated cannot be melted. They decompose explosively while they ure still in 
the solid state. It is not unlikely, therefore, that hot spots which develop on the 
surfaces of these explosive crystals, or between the crystals and the steel surfaces, 
could reach temperatures above the minimum necessary for explosion. With the 
secondary explosives this will not occur, since the temperature rise is limited by 
the melting of the explosives. 

The effect of added grits on two of these explosives was studied, and again it 
was found that only grits of melting-point 500° C and greater were effective sensi- 
tizers for lead azide and lead styphnate. Some results are set out in tables 4 and 5. 
Mercury fulminate and tetrazene were not treated in this way, since they were 
considered to be too sensitive to friction even in the absence of grit. 

2. iNITIATIOlSr ox EXPLOSION BY IMPACT 

The effect of added grits on the impact sensitivity of various explosives has also 
been studied. 

Experimental 

In the first series of experiments P.E.T.N. was used. About 25 mg. of this ex- 
plosive was spread as a continuous layer of crystals on the surface of a J in. dia- 
meter Hoffman steel roller, and a similar roller was placed on top of the explosive. 
The two rollers were inserted in a in. diameter hole in a brass block mounted on 
a rigid table. A 226 g. Hoffman steel ball was suspended vertically above the top 
roller, and then allowed to fall freely. By repeating this experiment many times at 
various heights, a height (47 cm.) was found at which only very occasional explo- 
sions occurred. Further experiments were then carried out under these conditions, 
but before placing the top roUeif on the layer of explosive a little grit (1 or 2 mg.) 
was sprinkled over the explosive. 

Results I 

The results for P.E.T.N. are recorded in table 1, columns 5 and 6. It is apparent 
that they are very similar to those obtained in the friction experiments. Again 
there is sharp division; grits of melting-point 430° C and higher are effective sensi- 
tizers, while grits of lower melting-point are in general unable, to cause any 
increase in sensitivity. 

Similar experiments were carried out with the primary explosives lead azide, 
lead st 3 ^hnate, mercury fulminate and tetrazene. For convenience, similar rollers 
were used in the confining surfaces, and the hammer used was attached to an 
arm swinging freely about a point. Again the effective grits were those whose 
melting-points exceeded a certain value. 

The results for lead styphnate, mercury fulminate and tetrazene are set out in 
tables 5, 6 and 7. It will be seen that there is again a clear-cut relatioA between the 
melting-point of the particles and its influence on sensitivity. Experiments were 
also carried out with lead azide. Again it was found that the melting-point was 
important. Silver nitrate (m.p. 212° C), silver bromide (m.p. 434° C) and lead 
chloride (m.p. 501° C) had no appreciable sensitizing effect when the impact was 
provided by a 240 g. striker falling 29 cm. Under similar , conditions borax (m.p. 



344 r*. P. Bowden and 0. A. Gurton 

560° C) and chalcocite (m.p. 1100° C) gave 100 % explosion efficiency. Bismuthi- 
nite and gelena had only small sensitizing effects in spite of their high melting- 
points, but both of these substances were soft compared with the azide. 


Table 6. Iottiatiok oe mercury eulmestate 

impact initiation 


grit 

hardness 

m.p. 

rc) 

(240 g., 

no. 

35 cm.) 

(%) 

nil 

— 

— 

0/20 

0 

silver nitrate 

2 to 3 

212 

0/20 

0 

• silver bromide 

2 to 3 

434 

0/20 

0 

lead chloride 

2 to 3 

501 

0/20 

0 

silver iodide 

2 to 3 

550 

7/10 

70 

borax 

3 to 4 

560 

5/5 

100 

bismithinite 

2 to 2-5 

685 

5/5 

100 

chalcocite 

3 to 3-6 

1100 

5/5 

100 

galena 

2-6 to 2-7 

1114 

5/5 

100 

calcite 

3 

1339 

5/5 

100 

Table 7. 

Initiation of tetbazene 

BY IMPACT 


grit added 

hardness 
(Mohs’ scale) 

m.p. 

rc) 

impact initiation 
(240 g., 10 cm.) 

no. (%) 

nil 

— 

— 

0/101 

6 

silver nitrate 

2 to 3 

212 

0/30 

0 

potassium nitrate 

2 to 3 

334 

1/40 

2-5 

potassium dichromate 

2 to 3 

398 

0/30 

0 

silver bromide 

2 to 3 

434 

5/16 

31 

lead chloride 

2 to 3 

501 

4/14 

30 

silver iodide 

2 to 3 

550 

4/5 

80 

borax 

3 to 4 

560 

4/4 

100 

bismuthinite 

2 to 2-5 

685 

4/4 

100 

chalcocite 

3 to 3*5 

1100 

3/8 

38 

galena 

2-5 to 2-7 

1114 

4/4 

100 

calcite 

3 

1339 

3/8 

38 


The chemical nature of the grit particle 

An examination of the results set out in the various tables shows that the chemical 
nature of the grit particle has surprisingly little effect compared with its melting- 
point. The chemical properties of the substances used differ very widely, but in 
each case it is the melting-point of the particle which is the decisive factor. The one 
exception to this which has so far been observed is potassium chlorate. The hard- 
ness of potassium chlorate is 2 to 3 on the Mohs’ scale and its melting-point is 
c. 370° C. This is below the critical hot-spot temperature of 400 to 500° C. Experi- 
ments showed, however, that the addition of potassium chlorate could have some 
sensitizing effect on the initiation of certain explosives. When used with P.E.T.N., 
for example, no sensitization was observed for the friction experiments, but the 
explosion efficiency under impact was 18 %. This should be compared with the 
results given in table 1. When mixed with lead azide and investigated under the 



345 


Initiation of solid explosives 

conditions obtaining in table 4 it was found that again it had no apparent 
sensitizing effect on the friction initiation, but the explosion efficiency 'under 
impact was 6 %. With lead styphnate (compare table 5) the explosion efficiency 
under friction was 7 % and with mercury fulminate under impact (compare table 6) 
it was 11 %. Particles of potassium chlorate (m.p. 370° C) can therefore have some 
sensitizing effect when substances such as potassium dichromate (m.p. 398° C) 
have none. When we consider the strong oxidizing properties of potassium 
chlorate and the fact that when mixed even with stable substances like carbon, it 
will readily explode on heating, this observation is not surprising. It is probable 
that extended and more detailed observations would yield further information 
on the influence of the chemical properties of the grit on sensitizing (or, for that 
matter, on desensitizing) the explosive. It is clear, however, from tables 1 to 7 
that the overriding factor is the melting-point. 

Influence of size, thermal conductivity and hardness of the grit 

We should expect that the size, the hardness, and thermal conductivity of the 
grit particle would all play an important part because of the influence they have 
on the ease of the formation of hot spots. If the particles are too small and too 
numerous the energy of impact will be dissipated over many points of contact 
which are distributed over a large area, so that no single hot spot reaches the 
required temperature for ignition. Experiments on the frictional initiation of 
nitroglycerine rubbed between metal surfaces in the presence of carborundum have, 
in fact, shown that larger grit particles {100 in. size) were more effective in producing 
hot spots than smaller particles (0-6 to 10/6). 

It is clear that although the maximum hot-spot temperature is fixed by the 
melting-point of the particle, the ease with which the hot spot is formed will be 
very dependent upon the hardness. With a hard sharp particle the stresses will be 
concentrated at one or two points so that it will require a much smaller energy, 
under conditions both of impact and friction, to produce a localized temperature 
rise of the necessary magnitude. I£ the particle is soft it -will be plastically deformed 
or crushed so that this local concentration of the energy is not possible. Eor this 
reason we should expect that hard particles would be much more effective than soft 
ones provided the melting-point of the particles is above the critical value. 

Equally, we should expect that the thermal conductivity of the particles would 
be of some importance. The general relation between the thermal conductivity 
and the formation of hot spots and the incidence of explosion has already been 
established in the earlier work (Bowden, Stone & Tudor 1947). It is much more 
difficult to get visible hot spots on rubbing surfaces and also to initiate the explosion 
of nitroglycerine if the surfaces are good thermal conductors. An interesting 
indication of the importance of the thermal conductivity of the grit particle was 
provided by rock salt. The hardness of rock salt is 2 to 3-6 and its melting-point 
is c. 800° C. It should therefore sensitize the explosives to impact and friction. 
Experiments showed that this was so. Experiments carried out with P.E.T.N. 
and with lead styphnate showed that grit particles of rock salt initiated the 
explosion. In general, however, the explosion efficiency was appreciably lower than 



346 


F. P. Bowden and 0. A. Gurton 


■with the other grit particles of si mil ar melting-point but lower thermal conductivity. 
For example, its explosion efficiency with P.B.T.N. under the conditions of table 1 
was 50 % for fiiction initiation and 6 % for impact initiation. With other grits of 
giTini1fl.r melting-point but lower thermal conductivity, this explosion efficiency was 
about 100 % for Motion initiation and about 30 to 60 % for impact. 

Support for this ■view was given by simple experiments in which a small block 
of rock salt was rubbed against a rotating glass plate under a considerable load 
(c. 30 lb.). Any hot spots of 560° C or upwards woffid have been clearly ■visible. 
No such 'visible hot spo'ts were detected, and the whole block soon became too hot 
to hold. Other relatively poor conducting materials such as calcite and glass gave 
distinct -visible hot spots under the same conditions. It may be noted that the 
glass which is the hardest grit used in these experiments and which possesses a low 
thermal conducti-vity, gave -with most of the explosives the highest explosion 
efficiency. 

3. Initiation op explosion by the adiabatic compression op air 

It is apparent that the hot-spot temperature necessary to inititate explosion 
under the conditions of these experiments varies somewhat -with the different 
explosives investigated, but for all of them lies in the region of 400 to 560° C. This 
temperature is appreciably higher than the conventional ignition temperature 
obtained by dropping the explosive into a hot atmosphere or on to a hot plate. 
Some of these values are given in table 8, column 2. However, the explosions 
obtained in these determinations occurred after induction periods of at least 
several minutes, while the Motional hot spots were short-lived and were unlikely 
to last more than 10“® sec. An attempt was made therefore to estimate the minimum 
ignition temperature of the explosives nitroglycerine, P.E.T.N, lead azide, lead 
styphnate, mercury fulminate and tetrazene when the high-temperature source 
was applied for a very short time. The apparatus employed is sho-wn in figure 2. 

Table 8. Ignition tempbratttre itnder various conditions 



ignition 

temp. 

induction 

period 

min, hot-spot temp. (°C) 

for initiation 
by impact 
for initiation in presence 

min. temp, 
for ignition 
by adiabatic 
compression 

explosive 

(°C) 

(sec.) 

by friction 

of grit 

of air 

nitroglycerine 

200 « 

— 

450 to 480^^) 

— 

450 to 480 

P.E.T.K. 


— 

400 to 430 

400 to 430 

460 to 500 

lead styphnate 

250« 

90 

430 to 500 ' 

500 to 560 

570 to 600 

lead azide 

336® 

10 

430 to 500 

600 to 650 

570 to 600 

mercury fulmioite 

146® 

400 

— 

600 to 660 

630 to 690 

tetrazene 

160® 

5 

— 

400 to 430 

400 to 450 

( 1 ) Belayev & Yuzefovich ( 1940 ). 

( 2 ) Kast & Haid ( 1924 )* 

(3) Rinkenbach & Burton ( 1931 ). 

(4) Bowden, Stone <& Tudor ( 1947 )* 


A small quantity of explosive E was placed on the piston P which was mounted 
in the block B and could be moved up and do'wn by the screw 8. A larger piston i? 



Initiation of solid explosives , 347 

Tjras placed in the cavity A -which was of exactly the same shape as the piston. 
A 226 g. ball was dropped from 1 80 cm. on to the piston B. The air in .4 was thereby 
ooippressed iato the small volume E in about 2 msec, and then rapidly expanded. 
The total time of compression and rarefication did not exceed 10“® sec. In the 
adiabatic compression of an ideal gas. 



where % and are the initial and final volmnes, and and the iTii Hal and finn.1 
temperatures and y the ratio of the specific heats. In these experiments % and 
were kept constant, while was varied by raising and lowermg the piston P by 
means of the screw S. Consequently the maximum gas temperature could be 
calculated. The correction for the covolume of air was found to be insignificant, 
since the highest volume compression ratios used were only about 20: 1. 



cm. 

I I 1 

scale 

Fiomts! 2. Apparatus for the initiation of explosion by the rapid compression of air. 

The minimum volume ratio to give an explosion in three trials was determined 
as weU as the maximum volume ratio to give no explosions. From these values 
the ignition temperature was calculated. The results have been set out in column 6, 
table 8, and may be compared with the values obtained for the hot-spot tem- 
peratures necessary for the initiation of explosives by Motion and impact in the 
presence of grit which are shown in columns 4 and 5. It is interesting to no-te that 
the ignition temperatures are of a slightly higher order than the minimum hot-spot 
temperatures, but there is a much closer agreement between these figures than 
between the mmimuTn hot-spot temperatures, .and the conventional thermal 
ignition temperatures. Copp et at. (1948) calculated the order of temperature which 
would be required to initiate explosion by the following method. 



348 F- P- Bowden and 0. A. Gurton 

With most explosives the thermal decomposition obeys a first-order law 

jfc =- In — — , where t is the time in seconds, a the fraction decomposed, and k the 
t l-a 

velocity constant. In order that the explosion may propagate, a finite quantity of 
the explosive must be decomposed within the time of application of the hot spot. 
Since the velocity constant must also obey an Arrhenius expression, a relationship 
between the time t and the temperature T can be obtained for a constant fraction 
decomposed Alternatively, if t can be estimated, a can be calculated for various 
hot-spot temperatures. 

The following values of the velocity constants have been given for the thermal 
decomposition of nitroglycerine, P.E.T.N., cyclonite and mercury fulminate : 
nitroglycerine • 0-48,ooo/i2r (Roginsky 1932), 

P.E.T.N. i = e“^7,ooo/22ir (Robertson 1948), 

cyclonite 0-47,5oo/i2ir (Robertson 1948), 

mercury fulminate = 0-25,4Oo/i2T (Vaughan & Phillips 1949). 

In the next paper it will be shown that initiation of explosion may begin at 
5 X lO"”® sec. from the first instant of impact. The delay between impact and the 
first appearance of light from the explosion includes the time of compression during 
which the explosive is made to flow, seal off a gas pocket, and compress the gas, 
as weU as the time between the development of the hot spot and the onset of 
explosion. However, some further experiments reported in the next paper suggest 
that the explosion of P.E.T.N. occurs less than 10~5 sec, after the collapse of the 
gas pocket, that is, after the production of the hot sp6t. Using the above expres- 
sions for velocity constants, table 9 has been drawn up to show the expected pro- 
portion of each explosive decomposed in 10“"^ sec. at various temperatures. 

Table 9. Fraction of explosive decomposed in 10”*® sec. 

temperature mercury 


X — — 

S 



cyclonite 

fulminate 

(°K) 

CO 

nitroglycerine 

P.E.T.N'. 

( X 10 - 8 ) 

( X 10 “^) 

600 

327 

0-014 

0-006 

2 

7-2 

650 

377 

0-240 

0-120 

0-005 

0-004 

700 

427 

0-980 

0-620 

0-056 

0-015 

750 

477 

1-000 

1-000 

0-430 

0-051 

800 

527 

1-000 

1-000 

0-980 

0-14 

850 

577 

1-000 

1-000 

1-000 

0-31 

900 

627 

1-000 

1-000 

1-000 

h-57 

950 

677 

1-000 

1-000 

1-000 

0-95 


Since these calculations involve extrapolation for outside the range of temperature 
over which determinations of the velocity constant have been carried out, 'they 
cannot be expected to give accurate results, but the calculated values are never- 
theless in fairly good agreement with the values determined by adiabatic 
compression of air and by friction at solid surfaces. For 50 % decomposition the 
necessary temperature would appear to be about 400° C for nitroglycerine and 
P.E.T.N., about 480° C for cyclonite and a little over 600° 0 for mercury fulminate 
which compare with the determined values shown in table 8. 



Initiation of solid explosives 349 

The fairly close agreement between the determined hot-spot temperatures for 
initiation by friction or adiabatic compression of air and the values calculated by 
extrapolating thermal decomposition data provide additional evidence for the 
suggestion that the chemical change which constitutes the onset of explosion 
induced mechanically is the same change as occurs at lower temperatures during 
slow decomposition. 

It must be concluded that the mechanical initiation of these solid and liquid 
explosives is due to the development of local high temperatures by friction between 
solid surfaces, or by adiabatic compression of gas, and it is not due to direct 
mechanical activation of surface molecules. 

We thank the Ministry of Supply (Air) for support and equipment and also 
Dr James Taylor (Research Manager) and Imperial Chemical Industries (Nobel 
Division), from whose department one of us (O.A.G.) was seconded to this 
laboratory. 


Rbfeeences - 

Belayev, A. F. & Yuzefovxch, N. A. 1940 Acad, 8ci, U.B,8,S, 27, 133. 

Bowden, F. P. & Gurton, O. A. 1948 Nature^ 162, 654. 

Bowden, F. P., Mulcahy, M. F. B., Vines, B. G. Yoffe, A. 1947 Froc, Roy, 80c, A, 188, 291. 
Bowden, F. P. & Ridler, K. E. W. 1936 Froc, Boy. 80c, A, 154, 640. 

Bowden, F. P., Stone, M. A. & Tudor, G. K. 1947 Froc. Boy. 80c. A, 188, 329. 

Copp, J. L., Napier, S. E., Nash, T., Powell, W. J., Skelly, H., Ubbelohde, A. B. & Wood- 
ward, P. 1948 FMl. Tram. A, 241, 280. 

Eirich, F. & Tabor, D. 1948 Froc. Cawb. FMl. 800. 44, 666. 

Kast, H. <& Haid, A. 1924 Z. angew. Chem. 38, 43. 

Binkenbach, W. H. & Burton, 0. E. 1931 Army Ordn. 12, 120. 

Bobertson, A. J. B. 1948 J. 80c. Ohem, Ind., Lond.^ 113, 221. 

Boginsky, S. Z, 1932 Fhys. Z. 8ouget, 1, 640. 

Taylor, W. <& Weale, A. 1938 Trans, Faraday 8oc, 34, 995. 

Thomas, P. H. (1949) See Bowden, F. P. & Tabor, D. J. Inst, Mech, Eng, (in the press). 
Vaughan, J, & Phillips, L. 1949 J, Chem, Soc, (in press). 



Birth and growth of explosion in liquids and solids 
initiated by impact and friction 

By F. P. Bowden, F.R.S. and 0. A. Gtjeton 

Laboratory for the Physics and Chemistry of JRvbbing Solids, 
Department of Physical Chemistry, Cambridge 

{Received 17 January 1949) 

[Plates 11 to 14] 


The birth and growth of explosions initiated by mechanical and thermal means have been 
studied. Liquid and solid explosives show' a striking similarity. The point of initiation is 
always located at a source of local high temperature, for example, a hot wire, an electric spark, 
an impacted grit particle, or at a gas pocket suddenly compressed during impact. 

There is an appreciable time lag between the jSrst moment of impact and the first appear- 
ance of light from the explosion. With secondary explosives the tune lag depends on the 
conditions of impact (it could be varied from 60 to 150/4seo.), but for all the explosives 
studied the delays under similar conditions are approximately the same. Tor the primary 
explosives the time lags are usually much shorter, indicating that a different mechanism of 
initiation may be operative. 

The first stage of explosion in liquids is a bummg which begins slowly and aocd&ratea to 
^eds of 600 or even 1000 m./sec. This speed may represent, in the main, a mass movement 
of the gas products away from the centre of explosion. 

In most solid explosives (both primary and secondary) the first stage is again a slow burning 
which accelerates to speeds of several hundred metres -per sec. A second stage of constant 
velocity detonation then sets in. The detonation velocity (which varies from 1100 to 2300 
m./sec. according to the explosive and the physical conditions of the layer) may be identified 
with the low-velocity detonation in large charges, and the correct order of \relooity has been 
e^lained on hydrodynamic grounds. • 

It is suggested that the continued propagation of the low- velocity detonation stage in a 
liquid is. made possible by the rapid breaking up of the explosive by the detonation shock 
front, particularly if the liquid has a low viscosity. In more viscous liquids and solids pro- 
pagation is possible only if hot-spot sources are present in the explosive. The hot spots may 
be developed by rapid compression of gas pockets, or, if the solid has a high enough melting- 
point, by intercrystaUine frictiom 


Intbqdtjotion 

We have seen that the initiation of both liquid and soHd explosives occurs at a 
hot spot of small but finite size. In this paper we will consider the growth of the^ 
explosion as it develops from the small hot spot to a high-speed detonation. An 
account ‘of the development of the explosion in nitroglyceriae and other liquids 
has been given in earlier papers (Bowden, Mulcahy, Vines & Yoffe 1947; Mulcahy 
& Vines 1947; Vines 1947). It was shown that after the explosion had been 
initiated at an electric spark or a compressed gas bubble, it developed in two stages. 
The first stage was a rapid burning which spread at a velocity of about 400 m./sec., 
but this burning did not travel more than a centimetre before a detonation set in 
and attained a speed of 2000 m./sec. 

If the hot compressed bubble was trapped in a small hole or cavity, the general 
explosion was preceded by a very slow burning spreading at 20 m./sec. or less. 

[ 360 ] 



Birth and growth of explosion 351 

After about 50 juseo. the explosion burst out of the confines of the cavity and fired 
the rest of the explosive. 

This paper will describe first a more detailed experimental examination of the 
behaviour of liquids and then a study of the development of explosions in sohds. It 
will be shown that the formation of local hot spots may play an important part 
in the growth and propagation of the explosion, as well as in its birth. 


Experimental 

With the help of Mr J. S. Courtney-Pratt the drum camera was designed and 
erected. In principle the camera was similar to those described by earlier workers, 
but was very simple in design and both easy and safe to operate. The light alloy 
drum 29*2 cm. in diameter was mounted on the shaft of a high-speed electric 
motor. The film was mounted on the inside of the drum, and a prism fixed inside 
the camera threw the image of the explosion on the film. With this arrangement 
the centrifugal forces did not tend to throw the film off the drum, but kept the film 
pressed hard against it. By using a short-focus lens (Zeiss Sonnar / 1-9 of 5 cm. 
focal length) the magnification was reduced to a minimum, and thus since the 
writing speed was unaffected, the slopes of the traces were increased. The camera, 
which was contained in a box about 15 in. cube, which could be evacuated if 
necessary, was bolted to the floor under a concrete table. On the table above 
a 5 in. square hole, there was a block of steel with a straight sHt 0*125 mm. wide 
cut through it. The camera was focused on this sht which was so adjusteci that it 
gave an image on the film at right angles to the direction of motion. The explosive 
was spread on a transparent material (armour plate glass or mica) which was put 
on the slit. The whole arrangement is shown diagrammatically in figure 1. 

When an explosion took place the flame spread from the point of initiation, and 
as it progressed along the narrow area viewed through the slit, it made a record 
on the moving film. This film was moving at a cbnstant speed of c. g4:*6 m./sec., 
so that each centimetre along the film represented a time of 115 /tsec. The flame 
front therefore drew out a complete distance-time curve of its own movement along 
the slit. The slope of this trace at any point was a measure of the instantaneous 
velocity of propagation at the corresponding point on the sht. 

Liquid explosives ^ 

In the first experiments nitroglycerine was spread as a ring on a freshly cleaved 
mica surface with the centre of the ring immediately above the sht. The explosive 
was then hit by a flat brass striker. The fall hammer used for these experiments was 
of the pendulum type and similar to that described by Bowden et al. (1947). During 
impact the ring of explosive flowed outwards and inwards to give a thin film of 
liquid containing a single central air bubble. This bubble was rapidly compressed 
and therefore became a hot spot which constituted the point of initiation- In this 
way the point of initiation was located directly above the sht, and the explosion 
grew radiaUy along the sht. 



352- 


F. P. Bowden and 0. A. Gurton 


A typical photograph obtained with nitroglycerine is shown in figure 2, plate 11. 
A represents the point of initiation from which the explosion spread at an increasing 
speed until it reached the edge of the striker at B. The rate of propagation of the 
flame in this case accelerated from about 180 to 660 m./sec., and there was no 
fiign of any sharp discontinuity in the trace between A and B, and no appearance 
of detonation. Undoubtedly the flame corresponded to the 400 m./sec. stage 
observed by Mulcahy & Vines, but showed an acceleration which could not be 
observ^ at lower writing speeds. It was also obvious that this burning could 
propagate at least 2| cm. without giving rise to a detonation. 



striker 



< *5» "^iifcp -fc-lens 



Figtjkb 1. IMagraimnatic representation of the high-speed camera and explosion apparatus. 

When the arrangement of the explosive was changed to a ring with two pro- 
jections extending along the slit, again the prolonged accelerating burning was 
observed, but detonation occurred before the flame reached the edge of the striker. 
Blast patterns on the strikers showed that the detonation usually began at points 
which were not directly above the slit, and this accounted for the irregular nature 
of the explosion photograph. Figure 3, plate 11, shows a photograph obtained this 
way ; AB represented the accelerating burning. At B there was a dark space, some 




353 


Birth and growth of explosion 

farther burmng, and then a detonation. The dark space corresponded to a position 
at which undecomposed nitroglycerine remained on the striker. However, the 
dark space did not necessarily show that there was any real separation between the 
front of the flame and the point at which detonation set in, but merely that the 
transition did not occur exactly on the slit. 

Initiation of a liquid explosive could also be effected by a flat striker if it had 
a small cavity at its centre. The cavity trapped an air bubble which was com- 
pressed by the impact, and the hot spot so produced, initiated the explosion. 
A thin film of nitroglycerine was spread on the transparent anvil and covered by 
a brass block with a hole (4*8 mm. diameter) at its centre. The explosion was then 
initiated by a small cavity striker in the central hole. This explosion began as a 
burning which continued for some time inside the cavity, and then burst out 
into the surrounding film of explosives. When nitroglycerine was used, and the 
surrounding film was confined but unimpacted except for the central 4-8 mm. 
the general behaviour illustrated by figure 4 was observed. In a number of experi- 
ments the rapid burning was not observed, and it is clear that detonation can occur 
very soon after the flame bursts out of the cavity. This is illustrated in figure 5. 
Similar results were obtained when cavity strikers of hemispherical section were 
used, but the central section of the trace was fogged by the burmng of the thick 
layer of nitroglycerine built up round the striker during impact. A typical trace 
is shown in figure 6. 

Initiation of explosion in nitroglycerine by electric sparks gave results similar to 
those obtained by Mulcahy & Vines. The spark was generated under the surface of 
a confined layer of nitroglycerine by bringing together a wire and thin layer of 
silver on the glass. The wire and the silver were previously connected to the two 
terminals of a charged condenser. A typical photograph in figure 7 shows that the 
spark initiated a rapid and accelerating burning which later developed into 
detonation. 

Nitroglycerine could also be set off by a few crystals of lead azide initiated by 
a spark. Jn this case the explosive crystals detonated and immediately set off the 
nitroglycerine at its detonation velocity. This is illustrated in figure 8. 

Methyl nitrate when spread as a ring and hit by a flat hammer gave photographs 
showing the prolonged rapid and accelerating burmng from the point of initiation. 
Kgure 9 shows a photograph obtained by the impact initiation of methyl nitrate. 
A novel feature of methyl nitrate photographs was the vibrating flame front of 
the rapid burning. This may well be connected with the high volatility of this 
substance which might allow alternative boiling of the liquid and combustion in the 
vapour phase. Methyl nitrate was also set off by cavity strikers and gave explosion 
traces almost identical with those obtained with nitroglycerine. 

Diglycerol tetranitrate could be set off by impacting a ring of the explosive spread 
on mica, but the explosion which began at the compressed air bubble did not 
propagate far into the liquid explosive. Most of the liquid was undecomposed. 
Explosions initiated by cavity strikers were also very weak, and often gave in- 
sufficient light to record on the film. However, if several air bubbles were put in the 
liquid film, propagating explosions could be initiated by flat impact or by a cavity 



354 F. P. Bowden and 0. A. Gnrton 

striker. With cavity strikers the explosions showed the usual burning inside the 
cavity. 

Figure 10 shows a trace obtained by initiating aerated diglycerol tetranitrate 
by a hemispherical cavity striker. The continued propagation was obviously 
a detonation, and travelled at about 2050 m./sec. When flat strikers were used the 
rapid burning could be detected but was of short duration and detonation set in 
very early. 

Solid explosives 

The experiments were now extended to the study of solid explosives. In all the 
experiments with solids the explosive layer was so arranged that the centre of the 
area impacted was directly above the slit, the impact was provided by a falling 
ball which struck a flat ended cylinder (usually J in. diameter) resting lightly on 
the explosive. Before most experiments the layer of explosive was compressed 
under a load of 6000 lb./sq.in. in a hydraulic press. This was done in order to 
produce a layer which was uniform in thickness and as free as possible from 
included air. A simple device was used to measure the delay between the first 
instant of impact and the initiation of explosion. A spark in line with the slit was 
triggered by the electric contact between the falling ball and the steel cylinder. 
The vertical distance between the first appearance of a spark and the first light of 
the explosion gave a measure of the delay. 

Initiation of PJE.T.N. by impact 

The first er^losive studied was P.E.T.N. (pentaerythritol tetranitrate). If the 
explosive was spread as a uniform layer of small crystals, the point of initiation 
was seldom located on the slit, and in many of the photographs there was evidence 
for several almost simultaneous points of initiation. Figure 11 shows a typical 
photograph obtained by impacting a thin layer of crystals 0-1 mm. thick. The long 
trace to the left of the explosion photograph was caused by a spark which was 
triggered by the impact of the falling ball on the striker (see later). However, it 
was found that there were at least two methods by which a single point of initiation 
could be obtained, and could be located directly over the slit. The first was the 
inclusion of a single particle of grit. A small speck of glass was placed at the centre 
of the continuous layer to be impacted, and then the explosion trace obtained 
showed the first appearance of light at the grit particle. From this point the flame 
spread in both directions at fairly low speeds (100 to 400 m./sec.), but showed 
a tendency to accelerate as it approached the edges of the impacted area. A typical 
trace is shown in figure 12. The fact that the explosion always began at a grit 
partide (which as we have seen is a potential source of a hot spot) provides addi- 
tional evidence that the initiation is thermal. 

The second method by which the point of initiation could be located was the 
employment of ah annular distribution of the solid explosive. It has been shown 
that the sensitivity of a circular la3rer of P.E.T.N., spread on the anvil as a thin 
uniform layer of crystals, may be increased by removing the central portion of the 
layer, thus leaving a complete ring of powder (Yoffe 1948). The effect was ascribed 
to the consequent inclusion of a comparatively large air pocket which was sealed 



355 


Birth and growth of explosion. 

off and compressed during impact. In. this way a hot spot was developed by the. 
adiabatic heating just as it was when bubbles in liquid explosives were suddenly 
compressed. 

further evidence for this explanation has been obtained by the use of the high- 
speed camera. If the ring of explosive was spread on a transparent anvil with the 
gas pocket directly above the sht, and struck by a hammer, the photographic 
toce obtained on a moving film, was similar to that obtained when the explosive 
layer contained a grit particle. Figure 13 shows a trace obtained by iTnpft.nfTTig an 
annidar layer of P.E.T.N. spread on mica. The explosive film was 0*1 mm. tMck, 
and was compressed before being put in position for impact. The compression of 
the powder removed all large air spaces except the one purposely introduced. The 
energy and velocity of impact were the same as those used in obtaining the photo- 
graph shown in figure 1 1 , and the delay between impact and initiation (as measured 
by the vertical distance from /S to J. on the time scale) was 62 ;4seo., only 2 /raec. 
different from the delay in figure 11. This suggests that the actual mechanism of 
initiation is the same for the continuous and annular layers of explosive. 

Photographic observation of compression of air pocket 

In order to confirm that the explosion began at the air pocket and not at some 
other point in the explosive layer, a new impact apparatus was constructed. This 
is shown in section in figure 14. The striker H was a disk of glass fitting into a hard 
steel holder. Above the striker there was a metallic mirror M, and by means of 
a lens system L, a beam of light from a high intrinsic brilliancy lamp G was directed 
through the glass striker and transparent anvil. The explosive was spread as a ring 
and placed under the striker. By an electrical mechanism the shutter was opened 




scale 

Fiotme 14. Explosion apparatus for observing^he compression 
• of the air pocket before explosion. 

just before impact and closed again before a complete revolution of the drum 
had occurred. Thus a veriical fine which was obtained on the film showed the exact 
position of the hole in the explosive layer (figure 15, plate 12). Less than 10~® sec. 
before the explosion this line faded out, and the explosion always began from the 
position of the fade-out. The time of the fade-out may not correspond exactly with 
the collapse of the gas pocket, for the glass striker was always broken enough to 



356 


F. P. Bowden and 0. A. Gurton 


prevent the transmission of sufi&cient light, and blank experiments showed that this 
breakmg occurred soon after the instant of impact. The experiment shows clearly, 
however, that initiation occurs at the gas pocket. 

Delay between vmpad and initiation 

It has already been shown that a spark trace could be put on the film at the 
instant when contact was made between the falling ball and the steel striker. The 
distance along the film from the beginning of this trace to the appearance of 
explosion flame was therefore a measure of the delay between impact and the first 
appearance of the explosion. In table 1 some delay times determined for P.E.T.N., 
initiated under various conditions, have been set out. It will be seen that them is 
in general an appreciable delay (60 to 140 /tsec.). If the explosive was spread as 
a ring the delay timft was reduced by increasing the height of fall of the ball and 
thereby increasing the velocity of impact. The delay was halved by increasing the 
height of fall from 60 to 165 cm., in spite of a simultaneous reduction in the striker 
mass from 1860 to 530 g. In other words, the delay time was halved in spite of 
a reduction in impact energy from 11‘2 x 10* to 8-2 x 10* g.cm. Thus the faster the 
compression the earlier was the explosion. It is therefore considered that the 
majority of the time between impact and explosion is taken up crushing and com- 
pressing the explosive" powder and does not represent an induction period during 
which the explosion is undergoing some accelerating decomposition without the 
appearance of light. 

It is also significant that the delay with a ring-like layer is about the same as 
the delay with a continuous film (compare rows 2 and 3 in table 1), and as will be 
pointed out later, the delays observed with other explosives (cyclonite and tetryl) 
for the same conditions of impact were very close to the figures quoted in table 1 
if the same conditions of impact were observed. 


Table 1. Delay between impaot and explosion 


mass 

height 



average 

ofbaU 

offaU 


delay 

delay 

(gO 

(cm.) 

conditions 

(/^sec.) 

(/tsec.) 

1860 

60 

P.E.T.N. spread as a 

143, 135, 92, 144, 143 

131 



ring 



530 

155 

do. 

79, 60, 60, 60 

65 

530 

155 

P.E.T-N. spread as a 

59, 60, 67, 62, 58 

61 



continuous layer 





P.E.T.N. spread as a 

97, 137, 78 

104 



continuous layer 
P.E.T.]Sr. containing 

139, 122, 83, 78 

106 



glass particles 



250 

180 . 

P.E.T.N, containing 

91, 71, 113 

90 



carborimdum 
P.E.T.N. containing 

103, 74, 134, 66, 140, 136 

109 


lead chloride 
particles 

A few experiments with explosive containing grit confirmed this view. There was 
a large scatter in the results^ but there was no sign of a significantly shorter delay 



367 


Birth and growth of explosion 

when grit was present. The fourth row of table 1 shows three results obtained with 
flat impact on a continuous layer of P.E.T.N., and rows 5, 6 and 7 show the delay 
times obtained with the same set-up when glass, carborundum and lead chloride 
were included. There was no tendency for the delay to be reduced by the inclusion 
of grit particles. 

Impact of single crystals 

Large single crystals of P.E.T.NT. can be made by slowly cooling a hot saturated 
solution of the explosive in acetone. The crystals are elongated plates. It was found 
that these single crystals could be initiated by impact. Crystals of dimensions 
c. 5 X 3 X 1 mm. were placed on thick mica above the slit and covered by steel 
rollers. The explosion was then initiated by an 1860 g. ball falling 60 cm. Explosion 
photographs were obtained, but they were always irregular. One of these is 
shown in figure 16. The velocities of propagation were obviously low, but the most 
important observation was the fact that the delays between impact and initiation 
were 400 to 500 /^sec., while under the same conditions, with thin layers of the 
explosive the delays were 100 to 150 /isec. It is to be concluded that initiation of 
the. crystals did not occur until they have been crushed and broken by the impact, 
i.e. not until cavities and compressible gas pockets had been introduced. 

Propagation of the explosion in PB,T.N. 

It has been shown that propagation from the point of initiation achieves speeds 
of a few hundred m./sec., but no evidence of a second stage has been mentioned. 

In a series of experiments, the size of the impacted area was increased by varying 
the diameter of the hardened striker used. In each case the explosive was spread 
as a ring on the transparent anvil, the small air pocket sitting directly above the 
sht. Figures 17, 18 and 19 (plate 12) represent traces obtained with strikers of 
diameter 4-8, 12-7 and 25-4 mm. respectively. It was obvious that the propagation 
velocity increased as the diameter of the striker increased. Of course, there was 
an increase in the minimum energy necessary for initiation as the striker diameter 
increased, and the higher rates of propagation may have been due to the greater 
degree of confinement, i.e. the higher pressure of impact. However, the propaga- 
tions were of a continuous nature, and no sudden change in velocity was indicated 
by any of the traces. 

This did not apply if the area actually impacted was surrounded by a con- 
tinuous layer of the same explosive. If this unimpacted area was tmconfined, the 
explosion stopped abruptly at the edge of the striker, but if a steel or brass block 
was placed on top of the unimpacted portion so that it was lightly confined the 
explosion continued. However, it did not continue at the burning speeds of a few 
hundred m./sec. ; there was a sudden transition to a much higher velocity. This 
higher velocity was accompanied by the emission of a greater amount of light. 
Although the velocity of propagation of this second stage of the explosion was only 
about 1400 m./sec., it had aU the properties of a detonation wave. A typical example 
of this behaviour is illustrated by figure 20, which shows the explosion in a layer 
of explosive spread on a ring, but continued in both directions along ,the camera 


VoL 198, A. 


24 



358 F. P. Bowden and 0. A. Gurton 

slit. Sometimes an apparent dark space appeared where the inception of detonation 
occurred before the rapid burning had reached the edge of the impacted region. 
However, it should again be remembered that the camera observes only the 
phenomena which occur in the section of the explosive above the slit, and if the 
transition occurred at a point slightly off the slit, the detonation wave would 
reach the slit before the deflagration was complete because of the difference in the 
speeds of the two propagations. Figures 21 and 22 illustrate the two extreme 
examples of this. In figure 21 detonation set in immediately and there is little sign 
of a dark space. On the other hand, figure 22 represents the results of an experi- 
ment in which no precautions were taken to initiate the explosion on the slit. The 
P.E.T.N. was distributed as a continuous layer so that initiation could occur 
anywhere. This photograph taken at its face value suggests that the second stage 
or detonation begins well ahead of the advancing front of the first stage or rapid 
burning. However, this is iUusory and merely shows that the transition has taken 
place first at a point off the slit, and the fast detonation wave has reached the slit 
before the first stage burning. If the thickness of the explosive layer was increased, 
the propagation velocity in the unimpacted region was increased slightly, but the 
velocity showed no sign of approaching the hydrodynamic detonation velocity. 

Figure 28 shows a trace obtained by initiating a layer of loose crystals 0-1 mm. 
thick and figure 24 a trace from a layer 0-5 mm. thick. The fivefold increase in 
thickness caused only a 40 % increase in the detonation velocity. 

’ The shock wave in air produced by the detonation of a thin film of P.E.T.N. could 
initiate a further film of the explosive. A gap of about 1*5 cm. was left in the un- 
compressed layer of the explosive at some dis^nce from the point of initiation, and 
the whole of the explosive as before was Hghtly confined by a brass block. Figure 
26 shows that the shock wave in air which travelled faster than the detonation 
itself initiated the explosive as soon as it arrived, but not at the burning speed. 
It is true that the new propagation started at a subdetonation speed, but it rapidly 
accelerated to the steady rate without any sign of discontinuity or transformation 
from one type of propagation to another. 

Initiation by impact and propagation of ^plosion in cyclonite and tetryl 

High-speed camera studies similar to those described above have been carried 
out with two other secondary explosives, cyclonite (cyclo trimethylene trinitra- 
mine) and tetryl (trinitro phenyl methyl nitramine). If either explosive was 
arranged as an annular layer under a striker of circular section, impact caused 
initiation of the explosive at the air pocket, but the propagation rate of the explo- 
sion was often irregular, and was always considerably slower than the rates 
observed with P.E.T.N. Figure 26 is a photograph obtained by initiating an 
annular layer of cyclonite by the impact of a 630 g. baU falling 164 cm. The flame 
persisted for a considerable time. However, the initiation delay was 61 /^sec., 
which was surprisingly close to the delay times observed with P,E.T.N. under the 
same conditions of impact (see figure 13, which shows an initiation delay of 62 /^sec.). 
Figure 27 is a photograph obtained by initiating a pressed film of tetryl (with a 
central air pocket) by the impact of an 1860 g. ball falling 60 cm. Again the pro- 



Birth and growth of explosion 359 

pagation rate AB was much slower, but the initiation, delay, 140 /tseo., was again 
of the same order as that observed for P.E.T,N. (120 to 160 /«sec.) under similar 
conditions of impact. 

In attempting to propagate these explosions beyond the limits of the striker’s 
radius into an unimpacted region, results like those shown in figures 28 and 29 
were obtained. Apparently in neither case was the transformation from rapid 
burning to detonation possible. There was some evidence of the propagation of the 
burning just into the unimpacted area in the case of cyclonite. 

Although detonation of the unimpacted film could not be brought about by 
the rapid burning initiated by impact, it does not follow that thin films of these 
explosives earmot detonate when confined. Another set of experiments was carried 
out in which some lead azide was spread in the centre on the area (1-27 cm. dia- 
meter) to be impacted, while cyclonite or tetryl was spread as a continuous loose 
layer 0-2 mm. thick in both directions along the slit from the impacted area and 
covered by a steel block. Initiation of the lead azide at the centre of the impacted 
area was assured by the mclusion of a small chip of glass. It was then obvious that 
these explosives could be made to detonate in thin films . Figure 30, plate 13, 
shows the detonation of cyclonite at 2250 m./sec. and figure 31 the detonation 
of tetryl at 1480 and 1420 m./sec. The only apparent difference between these 
explosives and P.E.T.N. was their inability to transform their modes of pro- 
pagation from rapid burning to detonations tmder these conditions. 

Initiation by impact and propagation of prinuwy explosives 

High-speed camera experiments have been carried out with three primary 
explosives, mercury fulnoinate (Hg(ONC) 2 ), lead styphnate (lead trinitroresor- 
cinate, PbCeH8N’308H20), and lead azide (PbNg). Attempts were made to initiate 
these explosives when spread as continuous films under 0-48 cm. diameter strikers. 
These very small strikers were used so that there was a reasonable possibility of 
getting the initiation close to the sht. When thin layers of explosive and only just 
sufficient energy were used the traces obtained with mercury fulminate were of 
the form shown in figure 32. The explosion began as a rapid burning which trans- 
formed into detonation at the edge of the striker. Figure 33 is a photograph of an 
explosion of lead azide initiated by impacting a contihuous layer of crystals by 
a small striker using the same conditions of experiment as have been described 
above for obtaining figure 32. It is apparent that the steady detonation velocity 
1930 m./sec. is very rapidly attained. There is no evidence for a burning and no 
sharp change in propagation velocity like the changes observed with P.E.T.N. 
and mercury fulminate. 

Of course in these experiments no special precautions were taken to locate the 
point of initiation exactly above the slit, and consequently many of the traces 
were obscure in the impacted region. Some experiments were therefore desired 
with a view to initiating these primary explosives at known points. 

In the first series of experiments the explosions were spread on the toughened 
glass anvils as ring-like layers with the central space over the camera slit. It was 
thought that this arrangement could bring about initiation by the adiabatic heating 


* 4-2 



360 


F. P. Bowden and 0. A. Gurton 


of the air pocket trapped and compressed during impact. Figure 34 is a typical 
photograph obtained with mercury fulminate initiated by impacting a ring of the 
explosive with a striker 1*27 cm, diameter. There is no sign of a single point of 
initiation in this photograph, and the whole impacted region is obscure. Lead 
azide photographs obtained this way also showed no single point of initiation, but 
usually gave curved boundaries to the detonation traces indicating that initiation 
had occurred at a point off the slit. When lead styphnate was spread as a ring and 
impacted by a 1*27 cm. diameter striker, once again the point of initiation was 
never at the gas pocket. Figure 35 shows the photograph in which the point of 
initiation was nearest to the air pocket, but even here the explosion began at A 
and not at the gas pocket P. Thus there was no evidence for the initiation of these 
primary explosives by the compressional heating of gas pockets. 

The only satisfactory method of ensuring the location pf the point of initiation 
above the camera slit was the use of grit particles. Figure 36 shows a photograph 
from the initiation of a layer of mercury fulminate by impact when the explosive 
contained a tiny chip of glass at its centre. In this case the change from burning to 
detonation took place spontaneously at no obvious mechanical discontinuity, for 
the whole area of the explosive was impacted. With lead azide photographs similar 
to that shown in figure 27 were obtained. No burning could be detected. The 
behaviour of lead styphnate is illustrated in figure 38. The point of initiation was 
located at a grit particle on the slit, and the propagation began as an accelerating 
burning, but it continued into the unimpacted area without any very marked change 
in velocity and at an irregular speed. It must be assumed therefore either that the 
deflagration continued without detonation being set up, or that the detonation 
velocity was very low. In a second series of experiments a layer of lead styphnate 
was initiated by a central impacted layer of lead azide (as described for cyclonite 
and tetryl), and again the styphnate did not propagate at a velocity any greater 
than that shown in figure 38. A characteristic photograph obtained for lead azide 
initiation in this way is shown in figure 39. 

The delays between impact and initiation when these explosives were set off by 
hot spots developed on glass particles proved to be irregular. This may have been 
connected with the uncontrolled size of the grit particles used. Here are some values 
obtained when the impact was provided by an arm fall hammer of effective weight 
300 g. faUing 50 cm. : 

lead azide 148, 113, 68, 64, 59 

mercury fulminate 107, 83, 50 

lead styphnate 161, 152, 143, 129, 77 

Although the scatter in these results is very wide it can be seen that although the 
impact energy was only 1*5 x 10^ g.cm., the delays were of the same order as those 
obtained with P.E.T.N. using impact energies of 4*5 to 11*2 x 10^ g.cm. (see 
table 1). 

Initiation of explosion in larger crystals 

Lead azide was recrystallized from a saturated solution of ammonium acetate, 
but since spontaneous explosions occurred in some experiments the largest crystals 



361 


Birth and growth of explosion 

grown were about 0*6 mm. long. Mercury fulminate crystals of the same size were 
grown by allowing a solution of the explosive in a mixture of alcohol and 880 
ammonia to lose ammonia slowly. A row of lead azide crystals was placed on a sheet 
of armour-plate glass along the line of the camera slit, and struck by a flat brass 
striker weighing 200 g. and falling 40 cm. The instant of impact was recorded by 
an electric spark triggered by the contact of the hammer and a piece of aluminium 
foil. The traces obtained showed irregular intensities along their lengths and very 
short delays between impact and initiation (7 to 16 /^sec.). A typical trace is shown 
in figure 40, plate 14. 

When a row of mercury fulminate crystals was spread on glass, and struck by 
a steel striker using the fall of a 225 g. ball through 100 cm. to provide the impact, 
the explosion began as a deflagration. A typical trace is shown in figure 41. The 
burning beginning at A continued for only a short time. The delay between impact 
and initiation was also 100 /^sec., while under the same conditions of impact, fine 
crystals exploded and produce light 50 to 70 /^sec. after impact. It seems that in 
this case the development of a rapid explosion takes place only when the explosive 
has been crushed and a large surface formed. 


Initiation of solid Explosives by hot wires and electric sparks 

The results of experiments on the initiation of explosives by impact were nearly 
all in harmony with the view that hot spots were produced by such methods as 
friction at a grit particle, compression of gas or vapour, or viscous flow of material, 
and that these hot spots were the prime cause of explosion. It was therefore im- 
portant to shotv that if a hot spot were produced in a layer of explosive, a pro- 
pagating explosion could result. It has long been known that primary explosives 
could be set off by hot wires or by sparks; in fact, it was this property which 
defined them as primary explosives. Unconfibaed thick trains of mercury fulminate 
initiated by hot wires bum at a low speed before detonation sets in, while no 
initial burning is found in similar experiments with lead azide (Patry 1933). 

An apparatus was designed for the initiation of thin layers of explosives by hot 
wires, A small piece of nichrome wire was soldered at its ends to two flat strips of 
brass. The brass strips were bent to fit on to a piece of armour-plate glass and the 
glass placed on the camera slit so that the wire lay at right angles to the slit. The 
explosive was spread along the glass above the sKt, and covered by a sheet of 
laminated Bakelite. By connecting the two brass strips to the terminals of a 6 V 
accumulator the wire was heated to bright red heat and the explosive set off. 

The results obtained were very similar to those obtained in the mechanical 
initiation of the same explosives. Lead azide detonated immediately, mercury 
fulminate burned before detonating and lead styphnate explosions did not develop 
a speed above 700 m./sec. With mercury fulminate and lead styphnate the burning 
stages were of somewhat longer duration than those observed in mechanically 
initiated explosions; ^ 

A parallel series of experiments was carried out in which the explosives were set 
off by electric sparks. The results obtained were identical with the hot wire results. 



362 F. P. Bowden and 0. A. Gurton 

a fact which suggests that the main action of the spark is to provide a hot spot and 
bring about thermal initiation. 

A simple apparatus (shown in figure 42) was designed for initiating by an elec- 
tric spark, a thin film of explosive held under confinement. The rod W and the 
aluminium foil strip A were connected to the terminals of a 0-06 /tP condenser 
charged to 1000 V. The explosive film was placed on the glass G on which the 
aluminium foil had been stuck, and by applying a load to W the wire and foil were 
brought close enough to aUow a spark discharge through the explosive above the 
camera slit. 



The results for lead azide initiated by a hot wire and by a spark are sho'wn in 
figures 43 and 44 respectively. The photograph from spark initiation was obtained 
by using a very thin layer of this salt, and it revealed a very short but real delay 
between the passage of the spark and the onset of detonation. This delay was always 
observed, but it was only of the order of 1 to 4 /iseo. 

The very different behaviour of mercury fulminate is illustrated in figure 45. 
Here the propagation began as a slow burning which suddenly accelerated to about 
400m./sec. When the burning reached the discontinuity between the central 
Bakelite insulator and the steel block surrounding it, detonation was set up. If 
the discontinuity was removed by using a glass insulator in the centre, which was 
cemented into the steel, and lapped flat, then traces of the type shown in figure 46 


363 


Birth and growth of explosion 

were obtained. The faster deflagration now continued for some distance (2 cm.) 
before detonation began. A typical result with the spark initiation of lead styphnate 
is shown in figure 47. 

K the thin layers of the explosives were unconfined, the burning of mercury 
fulminate and lead styphnate were too slow and too weak to be recorded on the 
film, but lead azide was detonated immediately and the photographs could only, be 
distinguished from those obtained with confined layers of the same explosive by the 
slightly lower velocity of detonation. Obviously the degfee of confinement, i.e. the 
pressure during the deflagration stage, determined the rate of propagation of 
flame, and this may explain the diflference between the photographs obtained by the 
impact initiation, and spark initiation of mercury fulminate (figures 32 and 45). 
During impact the high pressure set up favours an acceleration of the burning 
rate, and hence detonation is begun at an earlier stage. 

It has long been reahzed that primary explosives may be initiated by electric 
sparks. It has already been shown that nitroglycerine when confined as a thin 
film may be initiated by a spark discharge but no evidence was available which 
indicated that sohd secondary explosives could be initiated in this way. It was 
found that P.E.T.N. could not be initiated by heavy sparks unless the explosive 
layer was held under compression while the spark was passed. This was done by 
turning the screw S shown in figure 42 so that the central portion was held under 
compression beneath the insulator H, and the remainder of the explosive film 
was left confined but unconstrained. Sparks from a 0-06 [iS condenser charged at 
100 V were then capable of initiating the explosive. Figure 47 is a photograph 
obtained by initiating a film of P.E.T.N. in this way; in this particular experiment 
a inuch heavier spark (1 /iE at 2000 V) was used. This photograph may be compared 
with the photographs obtained by the impact initiation of the same explosive, 
figures 20 and 21. It is very similar provided the light from the electric spark 
itself is ignored. 

DisctrssiON 
Liquid explosives 

The experiments with nitroglycerine have confirmed the earlier work. The 
explosion of a thin film of this substance could be initiated at a rapidly compressed 
gas pocket, or at an electric spark discharge. In each case the explosion began as 
a burning, and detonation developed at a later stage. If the initiation was effected 
by a cavity striker the first stage of the explosion was for some time (up to 60 /tsec.) 
confined to the small quantity of liquid sealed off by the striker. The burning 
velocity was of the order of 10 to 20 m./sec. or less. When the expldsion burst out 
of the cavity it usually produced a rapid burning in the bulk of the film and the 
explosion was later transformed into a detonation at about 2200 m./sec. With spark 
initiation or initiation by the compression of an air bubble trapped in the main 
body of the film during impact, only two stages of explosion could.be distinguished, 
the initial rapid burning, and the subsequent detonation. 

It has now been shown that the rapid burning stage does not show a constant 
propagation speed, but the flame front accelerates as it advaruxs. The distance through 



364 


F. P. Bowden and 0. A. Gurton 


which the flame travels before detonation begins depends on the experimental 
conditions. The high initial pressure in the film during impact does not favour 
a transition to detonation, and the rapid burning may continue for at least 2-5 cm. 
In spark initiated explosions where the film is at atmospheric pressure, detonation 
usually occurs before the flame of the rapid burning stage has propagated more 
than 0*6 cm. 

Methyl nitrate behaved in a manner similar to nitroglycerine although the rapid 
burning stage showed a vibrating front. Diglycerol tetranitrate was easily initiated 
by impact if gas spaces were present, but the explosion did not propagate far unless 
the whole film was first filled with tiny air bubbles. If this was done the onset of 
detonation soon occurred and the explosion was complete. 

The 'dark space’ (which was usually observed between the rapid burning and 
the detonation, and sometimes corresponded to an area on which undecomposed 
explosive was left after the explosion) became larger as the distance to which the 
burning continued increased. If the transition from burning to detonation actually 
occurred at a single point on the flame front the chances of this point being on the 
slit would be very small, and would decrease as the radius of the flame front in- 
creased. Since the detonation often travelled at speeds five times as great as the 
rapid burning it might be expected to reach the slit first and thus give a dark space 
in the high speed photographs. If the initiation is located on the slit by means of 
a grit particle or an air bubble, and the burning time is short, the dark space is 
negligibly small. 

The biimmg speed of nitroglycerine at low pressures has been determined by 
Andreev (1946) and found to be a linear function of the pressure. At higher pressures 
Muraour (1942) has shovm that the burning rate of propellants is proportional to the 
pressure. During impact very high pressures are developed, but they are unlikely 
to exceed the flow pressure of the metal. Calculating the burning speed of nitro- 
glycerine by extrapolating Andreev’s figures to 10^ atmospheres (the approximate 
flow pressure of hard brass) we obtain a theoretical burning speed of about 10 m./sec. 
which is of the same order as the speed observed inside the brass cavity strikers. 
However, when the explosion flame bursts out into the main fiOlm of explosive, it 
attains speeds of 400 to 600 m./sec. although blast patterns on the brass confining 
surfaces show that the pressure is not much greater than 10^ atmospheres. This 
suggests that the whole front must be undergoing a mass movement which is 
supermiposed on the true burning speed, and attains speeds 40 to 60 times as great. 
In other words the products and the explosive are being forced away from the 
centre of the explosion. 

It has in fact been shown by direct photographic measurement that hquid between 
the impacting surfaces may easily attain a speed of 100 m./sec. even when no 
explosion occurs (Bowden, Mulcahy, Vines & Yoflfe 1947). 

As the mass movement accelerates it may attain speeds approaching the velocity 
of sound in the thin film, and will therefore build up a shock front. It also attains 
a speed of the same order and same direction as the streaming velocity of the 
products behind a low velocity detonation wave. The development of low velocity 
detonation from the rapid movement of the flame front is therefore not surprising. 



366 


Birth and growth of explosion 

The solid secondary explosives 

The results described in this paper all show how closely the behaviour of solid 
explosives resembles that of liquids (see also Bowden & Gurton 1948). The point 
of initiation in a layer of P.E.T.NT. is located at a hot spot source which may be 
a grit particle, a compressed gas pocket, or, under suitable conditions, an electric 
spark. 

The jSLrst stage of explosion is a rapid burning similar to that obtained in experi- 
ments with liquids. It shows an accelerating front, but no spontaneous change from 
burning to detonation is observed in explosions initiated by impact unless some 
discontinuity is present. However, if the explosive which is impacted is surrounded 
by an unimpacted but confined layer of the same material, detonation occurs in 
the unimpacted zone. The transition from burning to detonation takes place at the 
discontinuity, that is, at the edge of the impacted area. 

Cyclonite and tetryl are similar to P.E.T.N. in their behaviour, but the flame 
rates in the rapid burning stages are much lower. With these explosives the 
transition from burning to detonation was not observed even when a discontinuity 
was present. Experiments showed however that if the initiation is sufliciently 
powerful it is possible to get a detonation which continues to propagate. If a layer 
of lead azide crystals is exploded in contact with a thin confined layer of cyclonite 
or tetryl, a propagating detonation is set up. 

Once again in solid explosives the rapid burning is too fast to be accounted for 
by the rate of combustive consumption of the explosive under pressure. It is 
suggested therefore that it again represented a mass movement of the flame front, 
that is a movement of the gas products away from the centre of explosion. The 
combustion in the impacted zone builds up a high pressure, but no steep shock front 
is developed since the pressure of impact is itself very high. However, when the 
explosion bursts out from the central impacted zone the surrounding explosive 
received a sudden jet of high pressure gas at a very high temperature. This would 
be expected to produce an intense shock in the explosive, and apparently in the 
case of P.E.T.N. this shock brings about detonation. 

The solid primary explosives 

The point of initiation in the primary explosives lead azide, mercury fulminate 
and lead styphnate is normally located at a hot spot source, e.g, a hot wire, an 
electric spark or an impacted grit particle. However, an occluded air pocket does 
not appear to act as an initiator in impact. This is an interesting observation. 
With the secondary ex:plosives studied the melting point is below the ‘hot spot 
decomposition temperature'; they can be melted, and under impact can flow and 
seal off air pockets. The primary explosive cannot be melted. Their ignition tem- 
peratures are below their melting points. They are unKkely to melt and flow dtmng 
impact, and the air pocket may be difficult to seal off. However, the high rates of 
friction and shear may lead to the development of hot spots of sufficient tem- 
perature to initiate explosion, without producing melting. Thus, before the air 
pockets can be sealed the explosion may begin from hot spots produced by 
friction between crystals or between the confining surfaces and the explosive 



366 F. P. Bowden and 0. A. Gurton 

crystals. With those secondary explosives which melt without appreciable decom- 
position, the temperatures of the hot spots developed by intercrystalline friction 
canimot exceed the melting points and therefore will not be able to initiate 
explosion. 

The first stage of propagation, for both mercury fulminate and lead styphnate, 
is a rapid burning which usually shows an accelerating flame front. Pure lead azide 
does not show this stage of explosion, but detonation begins very near to the hot 
spot source. However, in spark initiation there is a short delay (1 to 4 /esec.) 
between the passage of the spark and the beginning of detonation, and it is probable 
that this represents a very short lived burning stage. 

Detonation usually begins in mercury fulminate at a mechanical discontinuity 
such as the junction between the impacted and unimpacted zones, but this is not 
always the case. It is probable that the mechanism by which a detonation wave is 
set up is similar in this case to the mechanism suggested for initiating detonation 
in liquid explosives. The rapid burning may involve the mass movement of material 
away from the centre of explosion which sets up a shock front in the undecomposed 
material. 

In lead styphnate the average velocity with thin films could attain a value of 
about 700 m./sec. and the flame front did not show a steady speed. It is unlikely 
that this is a detonation. 

Delay between impact and explosion 

In the experiments which were designed to measure the delay between the jfirst 
moment of impact and the first appearance of explosion flame, it was shown that 
the delay depended on the impact energy and was almost identical with P.E.T.N., 
cyclonite, and tetryl.if the same conditions of impact were used (figures 13, 26 
and 27, plates 11 and 12), For P.E.T.N. for example, under conditions of row 1, 
table 1, the delay was 131 /esec., while at higher rates of impact (row 2) delays of 
about 65 /^sec. were obtained. With other secondary explosives such as cyclonite 
and tetryl the delays were very similar. With tetryl, for example, under conditions 
of row 1, the delay was 140 /^sec., with cyclonite it was 130. Under the conditions 
of row 2 the delay with cyclonite was 61 /^sec. This result is dijSferent from that 
obtained by Rideal & Robertson (1948) who found delay times of the order of 
230 to 340 ^sec. 

They measured the delay between impact and the arrival of ionized gas at a point 
2 mm. outside the explosion film and found that this delay increased as the sensi- 
tivity of the explosive decreased. That is, in the order P.E.T.N., clyclonite, tetryl. 
However, the results of Rideal & Robertson are not incompatible with our obser- 
vations, for their delay times included both the mechanical delay between impact 
and onset of explosion, and the time taken for the explosion to reach the edge of the 
impacted zone, and cause the ionizing flame to shoot out to the receiver. It has 
already been shown that the initial speed of propagation is much slower in cyclonite 
and tetryl than it is in P.E.T.H. The values obtained by Rideal & Robertson are 
therefore generally higher than the values obtained by high speed photography, 
and increase as the flame rate decreases in the explosive investigated. The con- 



367 


Birth and growth of explosion 

siderably shorter delay when grit was added to the explosive (reported by Eideal 
& Robertson) was not observed. The delay times which we have observed for the 
primary explosives, lead azide, mercury fulminate and lead styphnate were much 
shorter and more variable than the delays for secondary explosives. This provides 
further evidence for a different mechanism of initiation in these primary ex- 
plosives, and again suggests that intercrystalline friction may be producing hot 
spots before the pressure has been raised high enough to produce any initiating hot 
spot by the adiabatic compression of gas. 

Large crystals of P.E.T.N. were initiated only after delays of the order of 400 
to 500 /isec. which were 3 to 4 times as long as the delays observed with thin layers 
of small crystals. This indicates that before initiation coiild take place the explosive 
had to be crushed and compressed. The very different result with large crystals 
of lead azide which showed a very short delay may be explained by the hypothesis 
that the explosion can be initiated by intercrystalline friction and consequent hot 
spot formation. 


Low-velocity detonation 

The detonation stages observed in thin films of explosives are characterized by 
velocities varying from 1 100 to 2600 m./sec. (neglecting the propagation of lead 
styphnate at 700 m./sec. which is probably a rapid burning). These values are 
much lower than those observed in large scale detonations (namely, 4600 to 
8000 m./sec.) and correspond with the low velocity detonation characteristic of 
nitroglycerine explosives which has also been observed in methyl nitrate, nitro- 
glycol and recently in solid explosives like T.N.T. and tetryl (Jones & Mitchell 
1948). If the expansion of the gas product is taken into account, these low velocities 
can be justified on hydrodynamic-thermodynamic grounds without postulating any 
detailed mechanism for the propagation. 

If the Abel equation of state . 

p(V-b)=nzBTz ( 1 ) 

is applied to the reasoning of the Chapman-Jouget hydrodynamic theory of detona- 
tion the following expression for the detonation velocity is obtained (Paterson 1948) 



where is the volume of a gram of the explosive, 6 is the covolume, the number 
of moles of explosion products per gram, B the molar gas constant, the detonation 
temperature and 'y=l-4-%iJ/{72- Here is specific heat of the products at 
constant volume. 

This formula would apply only if no expansion of the products took place before 
the reaction was complete. It can be shown by simple reasoning that if an expan- 
sion does occur the net effect on equation (2) is to increase the effective value of 

by a factor depending on the amount of expansion, and to decrease by a much 
smaller factor (never more than 10 %) so that if the expansion is large (as it 
probably is when thin layers of explosive detonate), TJ becomes so large that b is 



368 F. P. Bowden and O. A. Gnrton 

negligible in compaxison. Then the term 1^/(1^— &) tends to unity, and so equation 

(2) becomes , 1 

D = — ^{yn^RT^ (3) 

that is D = Zil c, 

y 

where C is the velocity of sound in the products. Since (y + 1)/7 is slightly less than 
2 we should expect to obtain a reasonably stable minimum velocity of detonation 
which is approximately twice the velocity of sound in the products. In addition 
to this there will, of course, be the normal maximum hydrodynamic velocity. 
Calculations based on equation (3) have given the values for the stable minimum 
velocity shown in column 3 of table 2. Experimental values in column 2 were 
obtained by plotting a frequency curve of a large number of determinations and 


recording the commonest value. 

It is interesting to 

see that the results calculated 

by this simple theory for the velocity of detonation 

in thin films of explosives are 

in reasonable agreement with the observed values for both solids and liquids. 

Table 2. Detonation velocities in thin btlms 


velocity observed 

velocity calculated 

explosive 

(m./sec.) 

(m./sec.) 

nitroglycerine 

2200 

2240 

P.E.T.lSr. 

1450 

2380 

cyclonite 

2300 

2340 

'tetryl ’ 

1500 

2060 


• In P.E.T.N. and tetryl it is probable that the explosion products are different 
from those obtained in a high velocity detonation. The fact that the velocity in 
nitroglycerine (Muloahy & Vines 1947) and in P.E.T.N. is not greatly increased by 
considerable changes in film thickness and in bulk density provided further support 
for this theory. 

Although the modified hydrodynamic theory gives the correct value for the 
detonation velocity, it does not explain how the explosive reactions are completed 
in the short times available. This work provides some evidence for two possible 
mechanisms for this which may operate under different conditions. 

In high velocity detonation the shook wave is sufficiently intense to raise the 
temperature of the condensed explosive to a high value by adiabatic compression 
(for example 3000“ G with nitroglycerine (Ratner 1947)) so that the reaction can 
continue. In the low velocity detonation, however, the shook wave could not raise 
the temperature more than about 40“ 0. It is suggested therefore, that in the 
detonation of nitroglycerine and methyl nitrate, it is the mechanical action of the 
shock wave which facilitates propagation. That is to say the shock front breaks up 
the liquid into small droplets which are thrown into the reaction zone and present 
a large burning surface and hence a rapid reaction can ensue. Evidence for a 
mechanism of this type is provided by the marked effect which an increase in 
viscosity has on the ease of the propagation. Thus it was not found possible to 



369 


Birth and growth of explosion. 

detonate a thin film of the viscous diglycerol tetranitrate, and it has been shown 
elsewhere that air free nitroglycerine made viscous by the addition of a very small 
amount of nitrocotton will not detonate at low velocity. 

When the explosive is present as a thin film between solid surfaces, its break up 
into droplets may be facilitated by the separation of the surfaces. This is possible 
because the speed of sound in the solid is greater than the detonation velocity so 
that elastic waves in the metal due to the shock of the explosion will travel ahead 
of the detonation front. Some indication that standing waves might be set up in 
the metal which confines the explosive has been obtained in earlier work (Bowden, 
Eirich, Mulcahy, Vines & Yoffe 1943) where the blast marks of the explosion 
showed an alternating pattern on the confining metal. This break up of the ex- 
plosive into droplets would also offer a reasonable explanation of the pitting and 
pock marks produced on brass during the detonation. 

There is, however, another mechanism by which the high rate of reaction may 
be maintained. If small bubbles of air are present or are introduced into an 
explosive, although the detonation pressure would raise the temperature of the 
liquid or solid phase less than 40 degrees, the adiabatic compression of the gas 
bubbles would produce local hot spots of several thousands of degrees. These hot 
spots would become new explosion centres. If the air was distributed through the 
explosive, the detonation wave would have at its head a shock front which would 
therefore become a wave of initiation of new explosions. The detonation of a thin 
film of diglycerol tetranitrate which occurs only when air bubbles have been intro- 
duced supports this view. The well known phenomenon of ‘dead pressing’ in soM 
explosives, and the insensitiveness of old blasting gelatine and of oast explosives 
would all, on. this view, be due simply to the removal of air. Figure 22, plate 12, 
shows the effect of dead pressing. Here the central region burned, but the explosion 
did not reach the unimpacted area above the slit before detonation had been set 
up. However, the detonation wave did not enter the impacted area, presumably 
because the pressure of impact removed most of the air and compressed the rest. 
In high melting primary explosives it is possible that hot spots formed by crystal 
fracture and intercrystalhne friction in the detonation wave front may be hot 
enough to act as new explosion centres. Thus, there is evidence that the formation 
of hot spots, which has been shown to be the cause of initiation of explosion by 
impact and friction, is often the means by which low vdocity detonation is main- 
tained. Once again the commonest source of hot spots is the rapid compression of 
gas pockets, which are present in the explosive, but with the primary explosives 
hot spots may be formed by friction. 

We thank the Royal Society for a grant for the development of the high speed 
camera, and the Ministry of Supply (Air) for support and for other equipment. 
Our thanks are also due to Dr James Taylor and Imperial Chemical Industries 
(Nobel Division) from whose Research Department one of us (O.A.Gr.) was seconded 
to this laboratory. 



370 


F. P. Bowden and 0. A. Gnrbon 


Refbbbncbs 

Andreev, K. K. 1946 C.It. A.ead. Sd. U.R.S.S. 51, 29, 123. 

Bowden, B. P. & Gurton, O. A. 1948 NcOure, 161, 348. 

Bowden, F. P., Mnlcahy, M. F. B., Vines, B. G. & Yoffe, A. 1947 Proo. Roy. Soo. A, 188, 
291 311. 

Bowden, F. P., Eirich, F., Mnlcahy, M. F. R., Vines, R. G. & Yoffe, A. 1943 Bes, Bull. Coun. 

Sci. Industr. Bes. Aust. no. 173. 

Jones, E. 1938 Proc. Boy. Soc. A, 120, 603. 

Jones, E. & Mitchell, D. X948 Nature, 161, 98. 

"^Mulcahy, M. F. R. & Vines, R. G. 1947 Proc. Boy. Soc. A, 191, 210, 226. . 

Muraonr, H. 1942 Ghwn. et Industr. 47, 602* 

Muraour, H. 1943 Ghim. et. Industr. 50, 105. 

Paterson, S. 1948 Besearch, 1, 221. 

Patry, M. 1933 Thesis, Kanoy. 

Ratner, S. 1947 Acta Physicochiin. U.B.S.S. 22, 357, 

Rideal, E. K. & Robertson, A. J. B. 1948 Proc. Boy. Soc. A, 195, 135. 

Vines, R. G. 1947 Nature, 160, 400. 

Yoffe, A. 1948 Nature, 161, 349. 


Descriptiok of Plates 11 to 14 
Plate 11 

Figueb 2. High speed camera photograph of an esiplosion of a ring of nitroglycerine set off 
by impact. The point of initiation A corresponds to a compressed air bubble, and the 
explosion AB spreads as a rapid accelerating burning at 180 to 650 m./sec. 

Figure 3. Photograph similar to figure 2 for which the explosive was spread as a ring with two 
projections. The rapid burning AB gives rise to a much faister detonation at B. 

Figure 4. Explosion of a confined film of nitroglycerine by the impact of a cavity striker. 
Imtiation occurs inside the cavity at A, and the slow burning AB (c. 10 m./sec.) is followed , 
by a rapid burning in the confined film BG. Detonation GD at about 2200 m./sec. is the 
first stage in the explosion. Detonation waves GB travel back from the points at which 
detonation is first observed. 

Figure 5. Explosion of nitroglycerine under conditions similar to those used for figure 4. 
In this photograph the rapid burning stage is not apparent. 

Figure 6. Photograph similar to figure 4, using a hemispherical ended cavity striker. The 
rapid burning stage BG is somewhat obscured by optical effects associated with the geo- 
metry of the apparatus. 

Figure 7. Explosion of confined nitroglycerine initiated by an electric spark A, showing the 
rapid burning stage AB followed by detonation BG. 

Figure 8. Explosion of a confined film of nitroglycerine initiated by the detonation of a few 
crystals of lead azide at A. Only the detonation stage AC is observed. 

Figure 9. Explosion of a ring of methyl nitrate initiated by flat impact. Initiation occurs at 
a confined gas pocket A, and the rapid burning stage AB shows an accelerating, but 
vibrating flame front. BG represents the detonation stage which occurred after the 
burning speed had reached 1000 m./sec. 

Figure 10. Explosion of a confined aerated film of diglycerol tetranitrate by the impact of 
a cavity striker. AB represents the slow burning inside the cavity (53/isec.). Between 
B and C (36 /isec.) there was no sign of explosion. The central region of the general 
explosion is obscure, but the detonation stage DE can be clearly distiuguished. 



371 


Birth and growth of explosion 

FiGtrBB 11. Explosion of pentaerythritol tetranitrate (P.E.T.N.) by flat impact on a layer of 
crystals. The beginning of the spark trace S represents the first instant of impact. The 
first appearance of the explosion light occurs at A but there is no single point of initiation. 
The impact was provided by a 530 g. ball falling 155 cm. 

FiauBE 12. Explosion of P.E.T.N. initiated by impact with a grit particle at A. The explosion 
began at the grit particle and spread as a rapid burning AB, 

Figubb 13. Explosion of a layer of P.E.T.N. spread as a ring, and initiated by a flat impact. 
Initiation occurred at the gas pocket A and spread as a rapid burning AB. The delay 
between impact and explosion was sitnilar to that observed in figure 11, for the conditions 
of impact were the same. The average flame speeds were 435 and 360 m./sec. 

Plate 12 

Figubb 15, Explosion of P.E.T.N. spread as a ring and impacted by a flat glass striker, 
;S' is a spark trace produced at the instant of impact and about 65 /tsec. later a beam of 
light passing through the central gap in the explosive is extinguished at G, The explosion 
begins at A less than 10 /^sec. later, and spreads as a rapid burning AB. 

Figubb 16. Explosion of a single crystal of P.E.T.N. initiated by impact. 

Figubb 17. Explosion of a ring of P.E.T.N. by the impact of a 0*48 cm. diameter striker. 

Figubb 18. Explosion of a ring of P.E.T.N. by the impact of a 1-27 cm. diameter striker. 

Figure 19. Explosion of a ring of P.E.T.N. by the impact of a 2-64 cm. diameter striker. 

Figure 20. Explosion of P.E.T.N. initiated by impact on a ring of explosive propagated into 

a surrounding layer of confined but unimpacted explosive. In the impacted area only the 
rapid burning stage AB is observed but the surrotmding explosive detonates BC* Betona- 
tion waves BD travel back into the btirnt gas of the first stage of explosion. 

Figure 21. The same as figure 20, showing distinct acceleration in the first stage, and very 
little ‘dark space’ at B. 

Figure 22, Explosion of P.E.T.N. initiated by impact on a continuous layer surrounded by 
a layer of confined but unimpacted explosive. The first appearance of explosion light in 
the impacted region occurs at A, Between A and B (where detonation is set up) there is 
a distinct ‘dark space’. 

Figure 23. Detonation in a layer of P.E.T.N. 0*1 mm. thick of density 0*5 g/cm.® at 1100 
and 1200 m./sec. 

Figure 24. Detonation in a layer of P.E.T.N. 0*5 mm. thick of density 0*75 g./cm.® at 1575 
and 1680 m./sec. 

Figure 25. Initiation of a thin layer of P.E.T.N. by an explosion generated shock wave. The 
detonation BC gave rise to a shock wave in a layer of air OE which on meeting another 
layer of explosive set up an immediate detonation. 

Figure 26. Explosion of cyclonite initiated by flat impact on a ring of the explosive. The 
• explosion developed is a burning which is much slower than the burning displayed by 
P.E,T.N. aS represents the instant of impact and A the point of initiation corresponding 
to a compressed gas pocket. 

Figure 27. Explosion of tetryl initiated by flat impact on a ring of the explosive. The explo- 
sion develops as a rapid burning, but again the rate of burning is much slower than that 
observed in P.E.T.N. A, the point of initiation is located at a compressed gas pocket. 

Figure 28. An attempt to propagate the explosion of cyclonite into a surrounding layer of 
the explosive. BG represents a short lived burning developed in the surrounding layer, 
but no detonation is observed. 


Figure 29. A similar unsuccessful attempt with a layer of tetryl. 



372 


F. P. Bowden and 0. A. Gurton 


Plate 13 

Figure 30. Detonation of cyolonite by a layer of lead azide. AB represents the detonation 
of lead axide, and BC the detonation of a confined layer of cyolonite. 

Figure 31. Detonation of tetryl by a layer of lead azide. AB represents the detonation of 
lead azide, and BG the detonation of a confined layer of tetryl. 

Figure 32. Explosion of mercury fulminate initiated by impact. The explosion began as 
a burning AB, 270 to 300 m./sec. and at the edge of the impacted region transformed into 
a detonation BC at about 1600 m./sec. The light produced by the detonation of mercury 
fulminate was always strongly actinic and in consequence the traces were never very 
sharp. In this particular photograph the detonation BC shows some deceleration. This 
behaviour was exceptional and was due to the fact that in this experiment the explosive 
layer was so thin that all the crystals were not in contact. 

Figure 33. Explosion of lead azide initiated by impact. No initial rapid burning can be 
distinguished. 

Figure 34. Explosion of lead styphnate by impact on a ring-like layer. The point of initiation 
A is not located at the gas pocket. The explosion propagates as a rapid burning AB, 

Figure 35. Explosion of a ring of mercury fulminate initiated by impact. No single point of 
initiation can be distinguished. 

Figure 36. Impact initiation of a continuous layer of mercury fulminate containing a grit 
particle at A, From A the explosion spreads as a rapid burning AB which gave rise to 
a detonation BC (1450 m./sec.). 

Figure 37, Initiation of lead azide by impact in^ the presence of a grit particle. Initiation 
occurs at the grit particle A, but only the detonation stage ABC is observed. The speed 
was about 2300 m./sec. 

Figure 38. Explosion of lead styphnate initiated by impact in the presence of a grit particle 
A. From the point of initiation A, the explosion spreads as a rapid burning AB which 
continues in the •unimpacted region at roughly the same speed BC, 

Figure 39. Initiation of lead styphnate by lead azide. The detonation of lead azide AB gives 
rise to an explosion BC in the lead styphnate at 700 m./sec. which is much slower than the 
expected detonation velocity. 


Plate 14 

Figure 40. Explosion of large crystals of lead azide initiated by flat impact. Note the very 
short delay (< 10/«sec.) between impact S and explosion A, 

Figure 41. Explosion of large crystals of mercury fulminate initiated by impact. The first 
stage of explosion is a short lived rapid burning AB (350 m./sec.) followed by detonation 
BC, 

Figure 43. Explosion of unconfined lead azide initiated by a hot wire at A, Only the detona- 
tion stage is observed. ' 

Figure 44. Explosion of a very thin layer of lead azide by a spark A, There is a short delay 
between the spark and the onset of detonation. 

Figure 45. Explosion of mercury fulminate initiated by a spark S, A slow burning SA 
accelerates to a rapid burning AB from which a detonation BC begins at the discon- 
tinuity B, 

Figure 46. Similar to figure 45, but the onset of detonation occurs later, and not at a known 
mechanical discontinuity. 

Figure 47. Initiation of lead styphnate by a spark S, The first part of the explosion is not 
recorded. 

Figure 48. Explosion of P.E.T.N. initiated by a spark S while held under a heavy load. 
AB represents rapid b'uming in the constrained region. BC represents detonation. 




Proc. Boy. Soc. A, volume 198, plate 11 


Bowden and Ourton 



A 


B. 


g| 

i> 

1) 






27 











sec. ^see. jusec. jUBBC. 


Bowden and Gurton 


Proe.iRoy. 8oc. A, volume plate 13 



scale 












Influence of entrapped gas on initiation of explosion 
in liquids and solids 

By a. Yoyfb 

Laboratory for the Physics and Chemistry of JRvbbing SoUds, 
Department of Physical Chemistry, Cambridge 

{Communicaied by F. P. Bowden, F.R. 8 . — Received 17 January 1949) 

[Plate 16] 


Further evidence has been obtained for the view that the initiation of explosion by impact in 
liquids is due to the compression and adiabatic heating of trapped gas. It is shown that the 
sensitivity of an explosive is very dependent upon the pressure ratio. Flat impact experi- 
ments on nitroglycerine spread as a ring on a flat anvil show that the explosion effloienoy, 
which is high when the initial gas pressure is 1 atm., is reduced to zero when the initial gas 
pressure* is c. 30 atm. A high explosion efficiency is still observed when the initial air pressure 
is less than 10^^ mm., and it is suggested that under these con,ditions the initiation is due to the 
compression of the nitroglycerine vapour itself present at a pressure of c. 10“* mm. It is 
further suggested that the explosion begins in the vapour phase. 

The behaviour of some solid secondary e^losives such as P.B.T.N’. and cyclonite shows 
many similarities to that of the liquid explosives. When spread as a ring rather than as a 
uniform film of crystals, an increase in explosion efficiency is again observed. When the 
impacts are carried out at an initial gas pressure of 100 atm. there is a considerable reduction 
in explosion efficiency. The striker must be sufficiently hard to make the explosive flow 
plastically so that gas pockets can be sealed offi Again it would appear that the adiabatic com- 
pression of trapped gas initiates the explosion. Even when spread as a continuous film of 
crystals, the air spaces which are present between the crystals may be sealed off by local 
melting or plastic flow during the impact and the adiabatic compression of these gas spaces 
may initiate the explosion. 


iNTEODtrOTION" 


Previous work by Bowden, Mulcahy, Vines & Yoffe (1947) bas shown that the 
sensitivity of liquid explosives to impact is greatly increased if small gas bubbles 
are trapped in the explosive during impact. 

The initiation of explosion is of a thermal nature and is due to the adiabatic 
compression and heating during impact of the trapped bubbles. If ^ is the initial 
pressure inside the gas bubble and 2^ its temperature, the final temperature 2, 
reached inside the bubble when the final pressure has risen to pa is given by 

. < 1 ) 


The temperature rise 12— 2\ depends on the pressure ratio ^>2/^1 T ratio 

of the specific heats. 

The main evidence which was advanced to support the view that initiation is 
due to the adiabatic heating of trapped gas was 

(i) the decrease in sensitivity when the smaE gas bubbles are ehminated, 

(ii) the decrease in explosion efficiency when gases are used which have a value 
for y lower than that for air, 


Vol. 198. A. 


[ 373 ] 


25 



374 


A. YofFe 


(iii) the very short time from impact to explosion and the location of the point 
of initiation at a compressed gas bubble. 

This paper describes further investigations on the mechanism of initiation. The 
effects of the initial pressure and of the nature of the included gas have been 
studied more closely. In particular, if the incidence of explosion is related to 
of equation (1) then an increase in should lower the explosion efficiency. This 
point was investigated with an apparatus in which impact experiments could be 
carried out under pressures up to 100 atm. Another apparatus was constructed for 
carrying out impacts at pressures down to 10”® mm. and which could also be filled 
with different gases. 

A simple method of including the gas in the explosive was used : the liquid (or 
soM) is spread as a small ring on a flat anvil (see figure 1) and when this is struck 
with a flat hammer the small amount of gas in the centre is trapped and com- 
pressed. The initial volume of the gas is usually c. 5 x 10”® ml. and regular explo- 
sions may be obtained with an impact energy of 300 to 500 g.cm. 



Figijbb 1. Nitroglycerine spread as a ring. (magn. x 2.) 

The investigations, hitherto confined to liquids, have been extended to solids. 
It has been found that the behaviour of such solid substances as P.E.T.N. and 
cyclonite is similar to that of nitroglycerine and other liquids. The paper is divided 
into two sections. The first deals with the initiation of explosion by impact in 
liquids, and the second describes initiation experiments with solid explosives. 

I. Liquids 

Impact experiments at high initial gas pressures 
Further proof that the initiation by gentle impact of explosion in nitroglycerine 
is due to the compression and adiabatic heating of trapped gas was obtained by 
carrying out experiments in a closed vessel in which the initial gas pressure pi 


375 


Entrapped gas and the initiation of explosions 

could be varied between 1 and 100 atm. The final temperature will depend on 
the initial gas pressure Pi- If explosion requires a definite high temperature 
to be reached, then initiation will be the more difficult the greater the initial gas 
pressure. The arrangement used was as follows. Nitroglycerine was spread as 
a ring on a flat steel anvil inside a chamber which was then filled with air or nitrogen 
from a gas cylinder. When the gas pressure had reached a required value, the ex- 
plosive was hit with a flat steel hammer. A sketch of the apparatus is given in 
figure 2. 



FioxjitE 2. High pressure apparatus showing brass tube A] steel rollers JD, jE7; steel ball K; 
electromagnet LM ; steel head B; graphite washers W ; gas inlet F, G; gauge J and outlet 
H. Cylin(h:ical weight 0 shown on right. 

Some results obtained at different initial gas pressures are summarized in table 1. 
When the steel ball is allowed to fall in the cylinder at high pressure a small cor- 
rection has to be made to the energy of impact to allow for the viscous resistance of 

25-2 


S76 a. Yoffe 

the air. The correction is small and allowance has been made in the table for this 
effect. 

The explosion efficiency falls as the initial gas pressure is increased and drops to 
zero when the initifll pressure is 20 to 35 atm. The results provide strong evidence 
in support of the view that initiation is due to adiabatic compression of trapped gas. 
The difference between the results for air and nitrogen shows that the chemical 
nature of the gas is important. (See also Bowden et al. 1947; Mulcahy 1948). 


Table 1. Explosion eifeiciencies obtained with a eing of kitro- 

OLYOEEINE AT DIFFERENT INITIAL PRESSURES OF AIR AND NITROGEN 




kinetic 


explosion efficiency 

mass of 

height 

energy of 

initial gas 



striker 

of fall 

impact 

pressure p^ 

nitrogen 

air 

(go 

(cm.) 

(g.em.) 

(atm.) 

(%) 

(%) 


30 

3-4x10® 

1 

83 

90 

j 

36 

— 

10 

30 

— 

112 

36 

— 

20 

0 

— 

j 

36 

— 

25 

— 

20 


36 

3-4x10® 

30 

— 

0 


63*5 

7-1 X 10® 

1 

100 

100 


98 

— 

■ 20 

10 

— 

112 

98 

: — 

25 

0 

— 


98 

— 

30 

— 

36 


98 

7-1 X 10® 

35 

— 

0 


Impact experiments at low initial air pressures 

The effect of reducing the initial air pressure below atmospheric has also been tried. 
Assuming the final pressure p^ to be constant for a given impact energy, the final 
temperature Jg iucreases as the initial pressure is reduced. Under these con- 
ditions, however, the mass of gas trapped in the bubble becomes smaller and the 
quantity of heat developed is correspondingly lowered. 

The apparatus used is shown in figure 3 and consists of a collapsible bellows 
which was so designed that changes in gas pressure did not cause any relative 
movement of striker and anvil. The blow was delivered externally through a hard- 
ened surface . The nitroglycerine was spread as a ring on the anvil . With this apparatus 
the energy used in compressing the bellows itself is small. The pumping unit 
consisted of a Speedivac pump capable of pressures better than 10“^ mm. in series 
with a three stage mercury diffusion pump, and the pressures were recorded by 
a Pirani gauge having a range from 1 mm. to better than 10~^ mm. and a McLeod 
gauge calibrated to lO"”® mm. 

Some results obtained when the initial air pressure was less than 10”^ mm. are 
given in table 2, and the explosion efficiency for one of the strikers has been plotted 
as a function of the height of fall in figure 4. In these experiments the total pressure 
did not drop much below mm. which corresponds to the vapour pressure of 
nitroglycerine at room temperature. The energies of impact are such that no 
explosions are obtained when the nitroglycerine is spread as a continuous film. 



Entrapped gas and the initiation of explosions 


377 




Figube 3. Low pressure apparatus (section through centre). 


Table 2. Explosion efficiency obtained wit h a ring of 

NITROGLYCERINE AT A LOW INITIAL AIR PRESSURE 


Initial pressure pj, total pressure c. I0~^ mm. 

air pressure c. 10"® mm. 


mass of 

height 

energy of 

no. of exj^osions 

explosion 

striker 

of fall 

impact 

efficiency 

(g.) 

(cm.) 

(g.cm.) 

no, of impacts 

(%) 

210 

30 

6300 

10/10 

100 

95 

30 

2860 

23/26 

88 

95 

10 

950 

8/14 

57 

95 

5 

480 

1/12 

8 


The explosion efficiency for low energy impacts is appreciable eren when the 
initial air pressure is lem than 10"® mm. The vapour pressure of the nitroglycerine 
itself is 10"® mm. 



378 


A. Yoffe 


Impact experiments in low pressure atmospheres of different gases 

When gasses having a value for y lower than air are included in the gas space, 
there is a fall in explosion efficiency. Values have been obtained for the explosion 
efficiency over a fairly wide range of initial gas pressures. The nitroglycerine was 
spread as a ring and pure samples of the gases introduced into the apparatus 
described in the previous section. 

The results obtained are given in figures 5 and 6. In figure 5 there is a comparison 
of the explosion efficiency when air 7= 1-4, 7= 1*08 and 7= 1-26 are 

used for an impact energy of 6*3 x 10® g.cm. It is seen that despite its smaller 
nominal value of 7 C5H12 vapour is more effective than C2H4 in enhancing the 
sensitivity to impact of nitroglycerine. Both are, however, less efficient than air, 
where the efficiency remains at 100 % even down to pressures of 10“® mm. When 
the impact energy is 2*9 x 10”® g.cm. the effect of C5H12 and C2H4 is similar (see 
figure 6). In adffition, low pressures of CCI4 (7== M3) and CHgONOg were tried. 
The latter was chosen because of its similarity to nitroglycerine. The explosion 
efficiency remains unaltered when the evacuated space is filled with 50 mm. of 
methyl nitrate vapour. This result is of some interest from the practical pomt 
of view. 



height of fall (cm.) 

Figube 4. Explosion efficiency against height of fall curve obtained at low pressures of air 
and ai^on. Mass of striker 95 g. + argon, 20 mm. O adr, 20 mm. • air, 10"® nun. 

Discussion 

The experiments described above demonstrate the sensitizing effect of bubbles 
of Jir-pentane, carbon tetrachloride and methyl nitrate. These are gases whose 
critical temperatures (470, 556 and c. 510° K) are above room temperature and 
which condense if compressed slowly. Imtiation of explosion is facilitated even by 
high mitial pressures, e.g. 250 mm . of Ti-pentane, that is under conditions where 
liquefaction could occur very readily during slow compression. The relative 
efficiencies (see figures 5 and 6) of these gases bear no relation to their several 
critical temperatures. It would appear that condensation does not occur during* 
the very rapid (10~^sec.) compression. 



Eivtrapped gas and the initiaiion of explosions 37 & 

The adiabatic compression of saturated vapour need not always result in con- 
densation. The condition for no liquefaction to occur is (Roberts 1943) 

( 2 ) 

This relation is satisfied by carbon tetrachloride bnt not by ^-pentane or methyl 
nitrate. However, if one takes the value y = 1-08 for w-pentane, the estimated final 



Figtjkb 5. Explosion efficiency for different initial pressures of air, pentane and ethylene. 
Mass of striker 210 g. Height of fall 30 cm. 



temperature is c. 150® C which is much too low a temperature to explain the 
initiation of explosion (sea Bowden & Gurton 1949, who quote 480® C as the igoiMon 
temperature). 



380 


A- Yoffe 


The value of y appropriate to the sudden compression of a gas and defined by 
equation (1), is not the value of y at room temperature and pressure found from 
measurements of the speed of sound, etc. Even for the permanent gases, y varies 
with pressure and temperature (see International critical tables, first edition) and 
during a compression y depends on the rate of pressure rise (Lewis & von Elbe 
1939). This variation has been ascribed to the failure of the kinetic energy to share 
itself out among all the modes of motion. 

If 7^ is the number of degrees of freedom involved, 


r = 


A, w 


(3) 


and any reduction in n causes an increase in y towards the value y = 1*67 corre- 
sponding to a monatomic gas. Experiments on the dispersion of sound in gases 
confirm this view but lead to numerically different results (Alexander & Lambert 
1942; Richards 1939). It is possible that complex molecules such as pentane and 
methyl nitrate behave like monatomic gases during compression. However, the 
compression time before explosion occum is fairly long (c. sec.) compared with 
the period of relaxation during which equilibrium is established between trans- 
lational and vibrational energy of the molecules. It should be remembered that 
y as defined by equation (3) applies only to an ideal gas. This relation will hold for 
small compressions such as those occurring in a sound wave. When the pressure 
ratio is high deviations from the ideal nature of the gas must be taken into accoxmt. 
It can be shown that over a wide range of pressure, the deviation from ideal be- 
haviour has a marked effect on the relation between temperature and pressure, and 
the actual value for the temperature rise is higher than that calculated from 
equation (1). This provides an explanation for the sensitizing action of the conden- 
sable gases and of the self-sensitized explosion of nitroglycerine spread as a ring 
in vacuo (10"^ mm. Hg). Here it is the vapour of the nitroglycerine itself which 
undergoes the adiabatic heating and which initiates the explosion. 


Decompaction initialed in the vapour phase 

The detailed mechanism of initiation cannot be inferred from these results alone, 
and the exact origin of decomposition is still uncertain. It may begin by the simul- 
taneous (within 10”^® sec.) activation by hot molecules from the gas phase of two 
adjacent molecules in the interface (cf. Gamer 1938). The exponential factor 
associated with this binary event is and at the temperatures concerned 

(c. 500^ C) is extremely small, and it is unlikely that this is the correct mechanism* 

It is again suggested that the explosion begins as a binning in the gas phase 
of the vapour from the hquid explosive and that this inflammation spreads to the 
bulk, and the evidence for this view is summarized below: 

L The explosive vapour in the gas bubble is heated during compression to the 
same temperature as the gas. The liquid on the other hand must be heated by 
thermal conduction and it seems reasonable that the vapoin: will inflame first. 

2. The concentration of nitroglycerine vapour in air at room temperature is 
extremely small (one part in 10®), The possibility of the ignition of greatly diluted 




Proc. Boy. Soc. A, volume 198, plate 15 




381 


Entrapped gas and the initiation of explosions 

explosiv6 vapour has not been closely examined. Preliminary experiments by 
Chamberlain, Gray & Walsh (1947) demonstrated the ease with which methyl 
nitrate vapour, diluted ten times with nitrogen, ignites, and confirm the likelihood 
of such inflammation. 

3 . When the initial air pressure is low (10-^ to 10-® mm. Hg) the initiation is 
due to the compression and burning of the vapour of nitroglycerine heated to the 
ignition temperature. 

4 . Belajev (1938) has recorded the great difficulty of igniting, from the liquid 
phase, even such unstable hquids as nitrogen trichloride or nitroglycerine under 
high pressure, although estimated temperatures of c. 2000® C were produced. 

5. The possibility of igniting gas mixtures by sudden compression is well known 
(e.g. in the diesel engine and the experiments of White & Price 1919, and Buckler 
& Norrish 1938). 

6. The chemical nature of the included gas is important. Thus oxygen is more 
effective than nitrogen in sensitizing nitroglycerine to gentle impact (Mulcahy 
1948). The results shown in figures 5 and 6 indicate that nitroglycerine and methyl 
nitrate vapour are more efficient than inert vapours such as 71-pentane and carbon 
tetrachloride. The exothermic decomposition of methyl nitrate vapour greatly 
facilitates the growth of the explosion in the heated gas. 

When liquid explosives are detonated by Hght impact, the following sequence of 
events is therefore postulated. Sudden compression and heating of the trapped gas 
occurs and exothermic decomposition of the explosive vapour begins. Even in 
vacuo the vapour of the explosive itself suffers a considerable temperature rise. 
Furthermore, the physical and chemical heating thus induced lead to evaporation 
of explosive from the walls of the bubble to give a richer mixture than existed before 
impact. Inflammation of the explosive vapour near the walls becomes sufficiently 
violent to ignite the liquid itself, and the explosion grows as a rapid bummg through 
the bulk (cf. Bowden et cd. 1947; Bowden & Gurton 1949). 


n. Solids 

When a solid such as pentaerythritol tetranitrate (P.E.T.N.) is spread on the 
anvil as a ring and not as a uniform film of crystals its sensitivity to shock is 
increased. 

The explosive (25 mg.) was placed between two hardened steel roUers, 1-3 cm. 
in diameter, and the blow delivered by a steel ball released from an electromagnet. 
The results obtained when the ball weighed 1860 g. are given in figure 7 where the 
explosion efficiency is plotted as a function of the height of fall. The energy nec^- 
sary to give an explosion efficiency of 50 % is 2-8 x 10 ^ g.cm. for a ring and 
7*1 X 10^ g.cm. for a continuous film.* This increase in sensitivity parallels that 
of liquid explosives although the effect is not so marked. 

* La these experiments it was neoessaacy to use dry P.E.T.N. otherwise low values were 
obtained for the explosion efficiency- This effect is difficult to explain since the preeence of 
pentane or carbon tetrachloride had no desensitizing action (see later). 



382 


A. Yofee 


It could be argued that, in the case of a solid, the increase in efficiency with a 
ring of explosive is due to the greater pressure developed because of the smaller 
effective area of the strikers. However, though experiments with strikers of dif- 
ferent areas have shown that the explosion efficiency does increase somewhat as 
the area of the striker is reduced, the effect is small compared with that observed 
with this particular distribution. 



Figube 7. Explosion efficiency €^ainst height of fall curves for P.E.T.]Sr. spread 
a continuous film a, and as a ring 6. Mass of striker 1860 g. 


Impact expmmmAs in nitrogen at high pressure and 
with strikers of different hardness 

^ The experiments with nitroglycerine at a high initial gas pressure confirmed the 
view that the imtia,tion of explosion in liquids is due to the adiabatic compression 
of trapped gas. Similar experiments were therefore carried out with P.E.T.N. 


Table 3. Explosion sEEiciBNcry obtained with a eing oe P.E.T.N. 

AT A HIGH INITIAL PBESSTJEE OE NITEOGEN 


height 
of fall 
(cm.) 

23 

25 

36-5 


mass of striker 2 kg. 

weight of sample 20 to 25 mg. 

hardness of rollers D and E 600 to 800 kg./mm.® 


initial gas 
pressure 
(atm.) 

kinetic energy 
of impact 
(g.cm.) 

explosion efficiency 

% 

1 

4*6x10^ 

7/12 

58 

1 

5xl0« 

10/10 

100 

100 

5xl0« 

2/15 

13 



Entrapped gas and the initiation of explosions 383 

The apparatus used is that shown in figure 2 except that the hardened steel weight 
0 weighing 2 kg. was used instead of the spherical baU K. 

The results with the P .E.T.N. spread as a ring are given in table 3. Measurements 
of the time of fall of the weight 0 showed that the viscous resistance of the air 
produced no appreciable reduction in velocity. 

There is a reduction in the explosion efficiency from 100 to 13 % when the ioitial 
gas pressure is increased from 1 to 100 atm. 

The effect of hardness of the striker 

In carrying out impact experiments with solid secondary explosives, heavy 
strikers fall several feet and very high pressures may be developed in the solid, 
the limiting pressure being the dynanodc flow pressure of the striker. When hardened 
steel strikers are used, the pressure in the P.E.T.N. film can reach 80,000 atm. 
before the steel deforms plastically. It is clear that the pressure ratio can still be 
very high even when the initial gas pressure is 100 atm. and the temperature rise in 
any trapped gas space can be considerable. 

So as to reduce the maximum pressure attainable, strikers of metal softer than 
steel were also used. The results obtained in these experiments are summarized in 
table 4. 

Table 4. Explosion efficiencies 

The sensitivity to shock of P.E.T.N. spread as a ring on a hard steel anvil under different 
initial gas pressures depends on the hardness of the striker. Mass of striker 2 kg. 





hardness 

initial gas 


height 



of striker 

pressure, 


of fall 



Vickers 

nitrogen 

explosion 

(cm.) 


striker 

(kg./mm.®) 

(atm.) 

efficiency 

46 


mild steel 

200 

1 

3/3 

56 


mild steel 

200 

1 

3/3 


r 

copper 

72 

1 

0/5 

82 


brass 

120 

1 

0/3 


mild steel 

200 

1 

6/6 



mild steel 

200 

lOO 

2/10* 


* These two explosions were of a very localized character. 

No explosions are obtained with strikers of copper or brass. During the impact 
both the copper and brass strikers are deformed by the crystals of P.E.T.N. and 
a deep imprint of the explosive ring is left on the face of the striker (see figure 8, 
plate 15). The ring of P.E.T.N. appeared not to flow at all, and its dimensions were 
roughly the same before and after impact. When mild steel surfaces are used, 
explosions are readily obtained. Examination of the surface of the steel striker after 
explosion shows that only slight plastic dei^ormation has occurred. Thus in order to 
initiate explosion it is necessary to make the P.E.T.N. flow, and for this to occur the 
hardness of the striker must be c. 200 kg./mm.^. This means that the P.E.T.N. 
will be subjected to pr^ures of the order of 20,000 atm. 

Mild steel strikers were also used at an initial gas pressure of 100 atm. The table 
shows that the explosion efficiency is lower, and the explosions obtained were very 
incomplete, the only evidence of decompc^tion being a small oxidation stain on the 



384 


A. Yoffe 


steel surface. On the other hand the explosions at 1 atm. are generally complete 
and the efiSciency when the same weight falls about half the height is about 100 %. 

The difference between efficiencies at 1 and at 100 atm. pressure is striking. 
Even with nearly twice the height of fall the efficiency at 100 atm. is and the 
explosions obtained are weak, despite the greater energy and momentum at the 
point of impact. 


Impact experiments at low initial gas pressures 

A series of experiments was carried out in a vacuum apparatus to see whether 
there was any change in explosion efficiency when the initial gas pressure was 
lowered. The apparatus used was similar to that shown in figure 3 but was more 
robustly made in order to withstand the high energy impacts. It could be evacu- 
ated at air pressures near 2 x 10"® mm. and total pressure when the P.E.T.N. was 
present of c. 10“^ mm. The total pressure was measured by a Pirani gauge, which is 
a contiuuously recording instrument and which may detect a HTnall amount of 
decomposition as soon as it occurs. The P.E.T.N. was spread as a T-rng on a fiat 

steel roUer and struck by another flat steel roller. The results obtained are eiven 
in table 5. ® 


Table 5. Explosion efficiency of P.E.T.N. in vacuo when 

SPEEAD AS A CONTTNUOITS FILM AND AS A EING 
M€lss of striker 1860 g. 

initial pressure of gas: 1. McLeod gauge: permaaent gases 2 to 5x 10“-® mm. 

2. Piram gauge: total pressure < 10“^ TviTn 

height explosion efficiencv 

offaU . ^ 

(cm.) continuous filTin 

6/9 -66% 2/2 — 

45 4/12 ~33% 14/18 73% 

The table shows that even at this low initial gas pressure the explosion efficiency 
IS higher when the P.E.T.N. is spread as a ring. The explosion efficiency at the low 

gas p^ure^ lower than in air at 1 atm., the difference being more pronounced 
With the continuous film. 

Table 6. Impact expeeiments on a sing of P.E.T.N. in difkeebnt gases 


explosion 


Maiss of striker 1860 g. 

Height of fall 45 cm. 



initial gas 
pressure 

gas 

r 

(mm. Hg) 

air 

1-4 

760 

ethylene 

1-26 

760 

carbon tetrachloride 

1-13 

11 

300 

ether 

1-08 

w-pentane 

1-08 

300 


8/8 
10/10 
2/2 
5/5 



Entrapped gas and the initiation of explosions 385 

Impact experiments in different gases 

Impact experiments on P .E.T.N. spread as a ring were carried out in gases having 
diflferent values for y. The gases used were air, ethylene, ether, 72.-pentane and 
carbon tetrachloride (see table 6). 

t 

The detonation of cyclonite by impact 

Experiments similar to those described with P.E.T.N. were carried out with 
cyclotrimethylene trinitramine (cyclonite). The results foUow the pattern described 
for P.E.T.N. though the difference in explosion efficiency of cyclonite when spread 
as a ring and as a continuous film is not so marked. Explosions could still be 
obtained when the initial air pressure was < 2 x 10"® mm. and the pressure of oyclo- 
nite vapour < 10~^ mm. 

The propagating properties of cyclonite rmder impact conditions are poor. When 
spread as a continuous film, the explosions were nearly all partial. The extent of the 
explosion was much greater with the ring. 

Disotssiojt 

The behaviour of P.E.T.N. under impact shows many similarities to that of 
nitroglycerine and other liquid explosive. It is suggested that the initiation of 
explosion is of a thermal character and is due to the adiabatic compression of gas 
spaces trapped in the solid during the impact. The evidence for this conclusion is 
given below. 

Several mechanisms exist by which the mechanical energy of the blow may be 
converted to heat. The energy of the impact may heat all the explosive uniformly. 
^However, even assuming that all the energy is converted to heat, the temperature 
rise for P.E,T.N. is only 100° G when the impact energy is sufficient to initiate the 
explosion. The ignition temperature of P.E.T.N. when the time of heating is 
c. 10“^ sec. has been estimated at 400 to 500° C (see Bowden & Gurton 1949), so 
that the above temperature rise would be too small. Bernal {1938) proposed that 
the thermal energy could be concentrated at localized points in the explosive. 
Bowden and co-workers (1936, 1947) have shown that local hot spots could be 
produced during the rubbing of surfaces, the temperature rise being limited by the 
melting point of the solid. With materials such as P.E.T.N. and cyclonite the hot 
spot temperature due to frictional heating would not exceed about 140 and 200° 0 
corresponding to the melting points, but again these figures are below the ignition 
temperatures. 

Several workers (Rideal & Robertson 1948 ; Eirich (unpublished)) have jK^tuIated 
a mechanism for the initiation of explosion on the basis of the formation of a molten 
layer of the solid explosive. There is evidence that local melting of the solid does 
take place during impact, due either to frictional heating or to the application of 
a non-uniform pressure to the solid (cf. Johnston & Adams 1913). Mayes and 
Eirich (unpublished) observed melting of such materials as P.E.T.N. and sulphur 
during impact. Some of their experiments have been repeated, and the r^ults 
may be seen in figures 9, 10 and 11, plate 15. Figure 9 shows the result of an 



386 


A. Yoffe 


impact on P.E.T.N., and a fiised (or flowed) layer may be seen on the anvil at the 
edge of the continuous film. The fused layer is most noticeable with low melting 
solids such as sulphur, figure 10. With waxy materials such as Tempfistik particles 
of molten substance are ejected firom the impacting surfaces (figure 11). 

Rideal and itobertson consider that the liquid explosive so formed is caused 
during impact to flow at a high speed through the small spaces between the explosive 
crystals and viscous heating raises the temperature of the liquid so much that rapid 
thermal decomposition occurs. This mechanism is not unlike that proposed for the 
initiation of thin continuous films of nitroglycerine in the absence of gas spaces 
(Bowden et al. 1947). However, it does not seem applicable to impact on solids 
under the usual conditions, since it is unlikely that that extreme velocity of flow 
is attained which the calculations of Cherry (1945) and Eirich & Tabor (1948) have 
shown to be necessary for the temperature rise due to viscous heating to become 
appreciable. It is more probable that the formation of a molten or plastic layer 
serves to seal off small air spaces, which — there is ample evidence — readily produce 
hot spots under impact. In other words, the melted and flowed explosive behaves 
just like a liquid explosive and it is reasonable to expect the mechanism of initiation 
to be the same as that already proved to be operative in the initiation of liquids. 


The initiation of explosion by the svdden compression of gas spaces 

When P.E.T.N. is spread as a ring and not as a continuous film the sensitiveness 
to impact is increased. During impact plastic flow or surface melting of the solid 
takes place and the relatively large gas sp^ce in the centre is trapped and com- 
pressed. The photographic investigations of Bowden & Gurton (1949) have shown 
that imtiation begins at this trapped gas bubble. The formation of a hot spot near 
* pocket may be demonstrated by a simple experiment with red mercuric 

r u spread as a ring and subjected to impact between flat surfaces, specks 

of the yeUow modification appear close to the centre of the ring. These rapidly 
revert to the red form. The transition occurs at 126° 0 and does not occur on the 
slow application of a similar pressure. 

Cyelonite and tetryl behave similarly to P.E.T.N. and sudden heating during 

abatac compression of the trapped gas appears to be the initiation mechanism 
lor solid explosives of this ty3>e.* 

A sin^r mechanism no doubt holds in the absence of a deliberately included gas 
space. Even when the crystals are spread as a ‘continuous film’ small gas spaces 
ar^rapped due to the plastic flow or surface melting of the crystals during impact. 

e results with P.E.T.N. spread as a ring showed that at moderately low 
pressums g^s such as carbon tetrachloride and T^-pentane which have low values 
or y ave the same sensitizmg effect on the explosion efficiency as the permanent 

“ general, detonate 

Some exDerime^ of a molten layer is therefore excluded. 

Httle *o*>^azene suggest that the compression of trapped gas plays 

by mS P thi® case that the initiation isTou|ht aboS . 

(cf. Bowden & Gurton 1949). ^ against each other or against the impacting surfaces 



387 


Entrapped gas and the initiation of explosions 

gases. This result is different from that obtained with nitroglycerine where these 
gases were less efficient than air. The reason for this difference is not difficult to 
jfind. It has been shown that for the solid to flow under impact the pressure 
developed is much larger than that to be expected from impact on a liquid, and 
amounts to thousands of atmospheres. In P.E.T.N. it is in the region of 20,000 atm. 
and consequently the temperature rise due to adiabatic compression of gas spaces 
trapped in the solid will be very high even for gases of high specific heat. In the 
absence of any air when the initial pressure is low (c. 10~® mm.) the space is filled 
with vapour of the explosive (c. 10“^ mm.) and once again it is the compression of 
the vapour which initiates the explosion. 

The strongest evidence for the dependence of initiation on the sudden com- 
pression of trapped gas spaces is provided by experiments carried out at high gas 
pressures. It has already been shown that the maximum temperature attained in 
an adiabatic compression depends, not on the final pressure, but on the ratio of the 
final pressure to the initial pressure. Since the final pressure is in some way deter- 
mined by the energy of the impact, it is clear that an increase in the initial gas 
pressure should reduce the maximum temperature attained in an included gas 
pocket. If initiation depends on the production of a local high temperature, then 
an initial high pressure would be expected to reduce the chances of an explosion 
if the impact conditions are unchanged. In the experiments wdth P.E.T.N. spread 
as a ring there was a marked drop in the explosion efficiency when the initial gas 
pressure was increased from 1 to 100 atm. of nitrogen. 

It would seem therefore that there is little difference in the mechanism of 
initiation by impact in liquids, or soKds such as P.E.T.N. and oyclonite. Under 
conditions where it is possible to include gas spaces in the explosive, ioitiation of 
explosion is due to the compression and heating of these trapped gas pockets. 

I should like to thank Dr E. P. Bowden, E.R.S. for his constant interest and 
advice, the Royal Society for the Mackinnon Research Studentship, and the 
Ministry of Supply (Air) for grants for equipment. My thanks are also due to 
Mr P. Gray and Dr O. A. Gurton for helpful discussions. 


References 

Alexander, E. A. & Lambert, J. D. 1942 Proc. Roy. Soc. A, 179, 499. 

Belajev, A. P. 1938 O.R. Acad. Sci. UJt.S.S. 18, 267. 

Bernal, J. D. 1938 Trans. Faradc^y Soc. 34, 1008. 

Bowden, F. B. & Gurton, O. A. 1949 Proc. Roy. Soc. A, 198, 337. 

Bowden, F. P., Mulcahy, M. F. R., Vines, R. G. & Yoffe, A. 1947 Proc. Roy. Soc. A, 188, 
291, 311. 

Bowden, F. P. & Ridler, K. E. W. 1936 Proc. Roy. Soc. A, 154, 640. 

Bowden, F. P., Stone, M. A. & Tudor, G. K. 1947 Proc. Roy. Soc. A, 188, 329. 

Buckler, E. J. & Norrish, R. G. W. 1938 Proc. Roy. Soc. A, 167, 292. 

Chamberlain, G, H., Gray, P. & Walsh, A. D. 1947 ^ P- Oonference on mechanism 

of inflammation and comhu^ion processes in ^ gaseous pThose, Paris, 1948. 

Cherry, T. 1945 Rep. Adu. Coun. Sd. Ind. Au^. A, 121, no. 9- 
Eirich, F. & Tabor, D. 1948 Proc. Gamb. Phil. Soc. 44, 666. 



388 


A. Yoffe 


Gamer, W. E. 1938 Trans, Faraday Soc, 34, 985, 1008. 

Jeffreys, H. 1935 Phil. Mag. 19, 840. 

Johnson, J, & Adams, L. H. 1913 Amer. J . Sd. 35, 205. 

Lewis, B. & von Elbe, G. 1939 J- Ghem. Phys. 7, 197. 

Mnlcaiiy, M- F. R. 1948 PhU. Mag. 39, 547. 

Richards, W. T. 1939 Bev. Mod. Phys. 11, 36. 

Rideal, E. K. & Robertson, A. J. B. 1948 Proc, Boy. Soc. A, 195, 136. 
Roberts, J. K. 1943 H&a and thermodynamics. Oxford University Press. 
White, A. G. & Price, F. W. 1919 J. Ghem. Soc. p. 1462. 


DESCEipnoN OF Plate 15 

Figxjee 8. (Magn. x 4*6.) a plastic deformation of a copper striker under the region of the ring 
of P.E.T.N. after impact; b as a but using a brass striker; c mild steel striker showing 
little plastic deformation during impaot and explosion. 

Figtjeb 9. Impact on P.E.T.N. crystals showing fused layer at the edge of the compressed 
powder. Striker 1860 g. falling 45 cm. 

Figxibm 10. Melting of sulphur during impact. 

Ftavms 11. Flow of Tempilstik (m.p. 316® C) during impact. 


The effect of diffusion of the main reactants on 
flame speeds in gases 


By j. Corner, Armament Besearch Establishment^ Ministry of Supply 

(Communicated by Sir John Lennard- Jones, F.B.S. — Beceived 15 January 1949 — 

Bevised 7 April 1949) 


The general equations of a single-reaction flame are written down, aJlowing for the diffusion 
of the main reactants and products. It is shown that the pressure-dependence of the flame 
speed is not altered by the consideration of diffusion. A method of successive approximation 
is applied to the equations with diffusion, giving an easily applied solution for the speed of 
a flame maintained by a single reaction whose rate depends only on the temperature and the 
concentrations of the reactants. 


1. Introduction 

Boys & Comer ( 1949 ) Lave discussed the speed of propagation of reaction zones 
(flam^) when diBFosion of the reactants can be neglected. In the present paper I shall 
discuss the effects produced by diffusion. The notation is, as far as possible, the same 
as in the earlier paper. 

I begin (§ 2 ) by a sketch of the problem considered; iu § 3 the equations of a moving 
reaction zone are written down ; methods of dealing with these equations are reviewed 
in §4, and an approximate solution, together with some general results, is given 
in §5. 



Ejfect of diffusion of main reactants on flame speeds in 389 


2. Notation; PROBLEM CONSIDEREB 

Let Ox be a fixed direction in space, normal to a plane front which is in steady 
motion in the direction of x decreasing. The tinreacted gas is at rest at large negative 
values of x. One wishes to predict the speed for which such a steady motion is possible* 
Unsteady states or detonation, in which a shock wave is intimately associated with 
the reaction zone, are excluded, so that the equations to be written down do not 
explain how such a steady and stable fiame could be generated. However, there are 
many cases in which a norma! fiame speed can be recognized, and to which the present 
work may apply. Practical application of the results has been made to a case of this 
kind (the burning of cordite). 

‘Flame speeds ’ mentioned in this paper are relative to the unbumt gas, being the 
‘fundamental flame velocity’ of Coward & Hartwell (193 a) or the ‘burning velocity ’ 
of Lewis & von Elbe {1938a). For a discussion of definitions of flame speed reference 
may be made to Coward & Payman (1937). 

As the flame passes through a layer of gas, the latter is heated, reacts and evolves 
heat. Since the pressure remains practically constant, the hot products of the 
reaction have a velocity (relative to the unbumt gas) in the opposite direction to 
the flame motion. Relative to the flame, the unbumt gas advances from the direc- 
tion of negative and passes through the flame zone where there is a rapid but 
continuous rise of temperature and a continuous rise of the degree of reaction. The 
burnt gases, at the temperature of complete reaction, move along the x axis, with 
a velocity which is nearly always greater than the velocity of the unbumt gases on 
the other side of the flame. A complete theory of flame propagation should give the 
temperature and reaction distributions in the flame zone, and the velocity of the 
stable flame. This can be observed experimentally, and provides a test of the accuracy 
of the general picture of the reaction zone. The structure of the flame is not so easily 
accessible to experiment, except at low pressures. 

Let the flame velocity be Uq. To produce a stationary space distribution of tem- 
perature and composition the axes are given a velocity in the same direction as 
the flame. The velocity of the gas in the direction of x increasing is then U, a function 
of X alone. At large distances in the unbumt gas, U is U^, The choice of origin of x 
is arbitrary, and does not affect our equations. 

The gas is assumed to undergo a single exothermic chemical reaction which 
maintains the flame. Let e be the extent to which the reaction has proceeded when 
the gas has reached a plane x, so that of the gases passing this place a fraction e 
(by mass) consists of the products of reaction. 

As it stands, this assumption of a single reaction seems to be a serious restriction 
on the application of our equations. For there are few Lumable gas inixtures in 
which several reactions do not go on together and in which reverse reactions can be 
neglected. A type of reaction to which our equations would apply would be a uni- 
molecular decomposition with negligible reverse reactions. This system is par- 
ticularly simple because of the small number of equations and functions involved- 
More general cases have more equations to detOTnine a greater number of unknowns, 
but th^ equations are set up exactly as in the dmple case. There are also many mom 


VoL 198. A. 


26 



390 


J. Corner 


parameters such as diEFosion coelB&cients of various substances and activation 
eneicgies of various reactions, but there is no essentially new type as compared with 
the simple case considered here.* 

In practical cases, it can often be assumed that certain reactions are fast com- 
pared with others, thus allowing the use of equilibrium constants instead of reaction 
rates, with a considerable simplification of some of the equations. However, the 
allowable simplifications vary so much from one case to another that it seems 
unlikely that any one theory can cover more than a fraction of the known flames. 
It is probable that each flame will have to be considered separately. 

While the flame equations’ can be solved numerically whatever the reactions 
assumed, the labour is very great. Analytical solutions are desirable but seem to be 
unlikely except for simple models such as that of the present paper. 

At a point x in the reaction zone, the temperature is K, and the specific volume 

is V em.®/g. One sq.cm, of the flame consumes unburnt gas at the rate M g./sec. 


Smce the motion is steady, 


U = MV, 


( 1 ) 


and M is the quantity to be found from the equations. Let suffixes 0, m refer to the 
initial and final states respectively, so that'at a; = — oo, T = Tq, gas velocity = Uq, 
specific volume = TJ, and e = 0; at a; = -foo, the reaction has been completed, and 
T — (temperature of completely reacted gas), gas velocity = specific volume 
= and e = 1. Heat transmitted through the flame by radiation is neglected, as is 
cooling of the products, except for heat flow by conduction through the flame 
towards the cold reactant. This heat flow maintains the flame. 


Usually is larger than Uq, so that, on transforming back to axes at rest with 
respect to the initial gas, the products are moving in the opposite direction to the 
flame. The alternative case could occur only if the flame were very cool and the 
number of molecules were sufficiently decreased in the flame reaction. 

Let the diffusion coefficient of the reactant be D cm.^/sec., the thermal conductivity 
of the mixture be A cal./cm. sec. deg. C., and let the pressure be P atm. In general, 
D and A will depend on e and the temperature. 

Let Q cal,/g. be the heat evolved by the reaction at temperature T, and let and 
Cp caL/g. be the constant-pressure specific heats of reactant and product at tem- 


perature T, We have 


dQldT = Cp~Cp. 


(la) 


We need also the homogeneous reaction velocity. Let ^(e, F, T) be the rate of 
change of e when the reaction proceeds homogeneously at temperature with 
composition and specific volume given by e and V cm.^/g. This reaction rate ^ can 
be determined in principle by the methods of chemical kinetics, without experiments 
on flames. 

The variation of any quantity 6, following the motion of an element of the fluid, is 
DejDt = ddldt^ Uddjdx = Uddjdx. (2) 


* The work of Lewis & von Elbe (1934) on the theory of flames in ozone-oxygen mixtures 
shows the complications introduced by having even as few as three substances present, and the 
drastic simpliflcations nec^sary. The theory of composition variations in a reacting medium 
has, however, been formulated in a very general way by Damkohler (1936). 



Effect of diffusion of main reactants on flame speeds in gases 391 

Therefore the rate at which an element of gas changes its composition by reaction is 

= [J5e/-D^]ciiem. ~ F, T), (3) 

In the present model the diffusion of active particles (radicals or atoms) is not 
included. Only the diJBFusion of the main reactants and products is considered. The 
presence of active particles might be taken into account by generalizing the equations 
to apply to a multi-reaction system in which production, diffusion and decay of 
radicals, and the reactions they cause, are all included. The fact that the radicals 
are present only in small quantities simplifies the equations to some extent; for 
example, their contribution to the pressure can be neglected. 

We have assumed that the reaction rate at any point depends on the temperature 
T at that point just as if the mixture were in a large vessel aU at temperature T. This 
means that we are assuming that there is negligible temperature variation along 
a distance of the order of a mean free path in the gas. The condition can be put in 
the form: let P be the pressure in atm.; then the effective breadth of the flame must 
. be much greater than 10'~^/P cm. 

We take into account diffusion caused by concentration gradient, but neglect the 
relatively small "thermal diflFusion’ due to the temperature not being uniform. The 
difference of composition set up by thermal diffusion between regions at 1000 and 
2000° K is usually less than 7 %. 


3. EQUATION’S or* THE BEACnON- ZONE 


(a) Composition equation 

Consider the region lying between x and x-^dxy and of unit cross-section. The mass 
of initial substance entering at a; is M{l—e) g./soc., and so the net amount leaving 
the region is _ Mide/dx) dx g./sec. 

By equation (3) the loss of initial substance by chemical reaction in the region is 

MMdxjU g./sec. 

The concentration of reactant is (1— e)/Fg./cm.^. Therefore the diffusional flow 
entering across the plane x is 

(P/F) (deldx) g./sec., 

and the net loss from the region by diffusion is 


dx 



do; g./sec. 


Since the system is in a steady state, 




{b) Heat equation 

A second equation is provided by the condition that there is no loss of heat by the 
region between x and x-hdx. 

The heat entering at x by conduction is — XdTjdx cal./sec., and the net loss of heat 


by conduction is 


-f- 

dx\_ 




da?cal./sec. 


26-3 



392 J- Comer 

The heat evolved by chemical reaction in the layer is MQ^dxjU cal. /sec. The net 
loss of heat by mass flow is 

M^i{c^{l-e) + c^e}T]dzc&l.lseo. 


As there is no net loss of heat, 


Elimination of ^ from (4) and (5) gives 


A 

dx 


.dT\ ^ d {Dde\ d. \ 


L 

dx 


[{4(l-e)+c^e}T] = 0. 


(5) 

( 6 ) 


(c) Momentum equation 

Conservation of momentum gives the hydrodynamical equation of motion: 
P-\~MU independent of x. The pressure at any point is connected with the com- 
position, concentration and temperature at that point. The relation may be written 

P=/(e,F,n (7) 

and by substituting this in the momentum equation one obtains a single relation 
between c, F and T. But at all practical flame speeds P is effectively constant through 
the reaction zone, and with ample accuracy (7) alone gives the connexion between 
T, F and e {taking P as the pressure in the undisturbed gas). 


(d) Boundary conditions 

The boimdary conditions on these equations must suf&ce to determine e, F and 
T a« functions of x^ and a solution must be possible only for one value of M. All the 
other quantities intix)duced are known from the nature of the mixture being burnt 
and its pressure in the undisturbed state. There are two differential equations (4) 
and (5), and one ordinary equation (7). The latter can be used to eliminate F in terms 
of e, T and of course P, which is one of the given parameters of the problem. There 
remain two .second-order differential equations for e and T as functions of x. 

For any value of Jf, the solution is determined if it must have known values of 
€, dejdx, T, dTjdx at a given point. The physical boundary conditions are that in the 
undisturbed gas there is no reaction and the temperature is T^; that is, at a; = — oo, 
jT = e = 0, dejdx = 0 , and therefore dTJdx = 0 . 

To form an acceptable picture of the reaction zone, there must be a pomt where 
the reaction has reached completion (e = 1), and at the same time the temperature 
has reached corresponding to complete reaction. These two conditions determine 
the point at which this happens and the value of M for which the condition is satisfied. 
That there is only one such flame speed (in steady motion) is suggested by experiment 
as well as made plausible by numerical solution of simple forms of the equations. 
If, on the other hand, pressure variations through the flame had been taken into 
account, with the possibility of shock waves, for example, then in some cases two 
widely different values of M would give solutions which satisfy all the boundary 



Effect of diffusion of main reactants on flame speeds in gases 393 

conditions. The smaller M would be the ordinary ' flame speed’, the larger M the 
‘ detonation velocity 

With most simple forms of reaction kinetics, the end-point of the reaction zone, 
where e = 1, is at infinity, and the boundary condition is then: T tends to and 
e to 1 as a; tends to in fin ity. It is possible that in other cases the end-point may be 
at a finite distance, in which case the region of burnt gas would have e = 1 and 
r = 2^ for all a; greater than the end-point. 


4. Elame theories 

The subject up to 1935-6 has been reviewed by Jost (1935, 1936) and Lewis & 
von Elbe (19386). In this section only a sketch is attempted. 

There are three equations of the problem, a reaction equation (4), a heat equation 
(5) and the equation of the isobars of the gas mixture (7). Most of the early theories 
used the heat equation (5), and avoided the reaction equation (4) by some plausible 
assumption, such as the introduction of an ^ignition temperature’ or ‘mean lifetime 
in the flame’. The problem was thus reduced from the solution of two simultaneous 
differential equations to the solution of a single equation. The earliest work on these 
lines was carried out by Mallard (1875) and Mallard & le Chatelier (1883), followed 
by Crussard (1914), Nusselt (1915) andDaniell (1930). 

Jouguet (1913 a) was the first to study the ftfil set of flame equations (neglecting 
diffusion) as a system to determine the flame speed and the distribution of tem- 
perature and reaction through the flame zone. Jouguet explained how the flame 
speed arises from the need to satisfy aU the boundary conditions. In a later paper, 
Jouguet (1913 6) iUu^rated his general method by solving a special case, chosen for 
its mathematical simplicity, and which nevertheless retained the main features of 
a real flame. Some of the simplifications were removed later (Jouguet & Crussard 
1919; Jouguet 1924). 

Impressed by the role of active particles in promoting reaction, Lewis & von Elbe 
(1934) attempted the relatively simple multiple-reaction case of O2-O3 mixtures. 
Drastic simplifications had to be made in the equation of conservation of energy, 
and were criticized later by Jost & Mu ffling (1937). 

More recently the equations of the flame have been studied by Zeldowitsch & 
Erank-Klamenetsky (1938) with the aims of including diffusion of reactants and 
eliminating the assumption of an ignition temperature. Their equations differed in 
detail from thos^ given in § 3. An approximate solution was obtained whc^ physical 
basis was that most of the reaction was assumed to take place near the maximum 
temperature 2^, on account of the rapid increase of reaction rate with temperature. 

A more recent theory is that of Boys & Comer (1949), intendedtoapply toreactions 
just outside condensed phases. The reaction rate was taken to have a temperature- 
dependence and three cas^ were discussed: a first-order rate from a uni- 

molecular mechanism; a second-order rate from a bimolecular reaction; a second- 
order rate as the low-pressure result of a unimolecular breakdown. Diffusion was 
neglected. The equations were solved by a method of successive approximaiion, 
whose accuracy was compared with a numerical solution. The first approximation. 



394 


J. Comer 


of the same nature as that of Zeldowitsch & Frank-Kamenetsky, was wrong by 
a factor as large as three; the second approximation was correct to within 10 to 15 % 
in the examples studied. 


5. Solution of the flame equations with diffusion 


(a) Combination of parameters 

We shah examine first the results of combming the parameters of the equations, 
a method used by Boys & Comer (1949). We assume that = Cp = c, and that c, A 
and D can be given mean values independent of T.* 

The equations are 




d_ 

dx 



^j^de 0t 


- Xd^Tldx^+McdTjdx = Q^jV. 


( 8 ) 

( 9 ) 


We assume that the system is a mixture of perfect gases, the reactant having average 
molecular weight W, the products an average weight w. Let n= T7/to — 1. Then 

PVWjRT = l+7te. (10) 

The gas constant B depends on the units chosen for P and V ; if these are atmospheres 
and cubic centimetres, then R — 82-06. 

For a first-order reaction (case I of Boys & Comer’s paper) 

^ = ( 11 ) 

Let y = x{PWBJX)i; substituting this and (11) in (8) and (9), 

MiPWBJX)ideldy- (PT7Pi/A)i)^ = PWB^{1 -e) e-^/«2’/PP(l +ne), (12) 

Mc[PWBJX)HTIdy-PWBid’^TIdy^ = QPFPi(l-e)e-^/«2’/PT(l + ne), (13) 
which can be written as 


(I)PF/A)^A(e, T,n,d£ldy)+M(XPWB^)-tdeldy = P(e, T,n,A), (14) 

Mc{XPWBj)-idTldy-d^Tldy^ = QF{e, T,n,A), (16) 

where F and/^ need not be written explicitly. The important point is that they depend 
only on the variables listed. The boundary conditions are: 

(i)!r = 2’o; € = 0; dejdy = dTjdy = 0, at y = -oo, 
and (ii) T = where e = 1. 

The boundary conditions and equations (14) and (15) contain the independent 
variable y, the dependent variables e and T, and the parameters 31,. T , n, DPWIX 
A, c,Qan.d M(XPWBj)-i. 

^oj Q are related by a thermochemical equation, one of t.lift iaft four is 

redundant; we shall omit Q. There are no other quantities in the problem. The 

* Of these, only X) varies rapidly with T. Unfortunately, i> behaves like or T^. 



Effect of diffusion of main reactants on flame speeds in gases 396 

equations and conditions can be satisfied simultaneously only if there is some 
relation between the parameters. We may write this relation as 

M = (APF£i)i sr(To, T^, DPWjX, n. A, c), (16) 

where p is a function only of the quantities listed. The corresponding result without 
difPusion, given by Boys & Comer, was that 

M = (APFBi)*A(To, T^,n,A,c). 

BPjX is independent of P except at very high pressures; hence the dependence of 
M on pressure is not altered by the consideration of diffusion. 

{y}, the change of y between points at which T takes given values {2^, T^, is a 
function of these values and also of if(APFPi)“*, T^, n. A, e and DPWjX. 
Hence {«}, the corresponding change in a:, is 

{x} = (XjPWB^fffTo, T^,DPWiX, A, c, T^, T^), (17) 

and in particular the effective flame thickness varies with pressure in the way 
predicted by the theory without diffusion. 

The method can be applied to the other types of reaction discussed by Boys & 
Comer, namely. 

Case 11: ^ a second-order reaction rate; 

Case III: ^ = £3(1 — e) (1 -l-ne)e“-^/^/l^ a second-order rate which would be 
produced as the. low-pressure form of a bimolecular reaction. For these, 

M = PW{XBfk{%,T^,DPWlX,n,A,c) (18) 

and {x} = {XlBP^W^f\{T^,T^,DPW}X,n,A,c). (19) 

The functions h and I are different in cases II and III. 

(6) DepeTidence on A aTid D 

It will be noticed that aU the preYiOTis results have the form 
M (or {a:}) = x function of (-D/A). 

This can be proved, by the methods of the preceding section, under more general 
conditions, which are that the conductivity and the diffusion coefficient D are 
independent of position in the flame. 

Theories neglecting diffusion have given the result that the flame speed is pro- 
portional to A^, and this has been tested by experiment. The comparison has been 
usually not very encouraging (cf. Coward & Payman 1937; Lewis & von Elbe 1938 6), 
since the flame speed does not appear to increase so fast as A^. This conclusion is not 
rigorous, because the flame speeds are measured for a series of mixtures of gases 
with very different conductivities (for example, and Og), and the change in A is 
guessed from the change in the proportions of the two gas^. The conductivity of 
a mixture is apt to vary in a manner which one would not expect from the con- 
ductivities of its components. 

That the flame speed varies less rapidly than A^ has been taken to mean that the 
reaction rate does not depend solely on the concentrations and the temperature, but 



396 


J. Comer 

is iMuenced also by the presence of smaU quantities of active particles (atoms or 
radicals). However, failure of the law if oc A* may sometimes be due to DjX not being 
the same for all the mixtures tested. ^ 

Damkohler (1940) has published an experimental and theoretical paper on the 
effect of turbulence on the bunsen flame, at Reynolds numbers up to 17,000. The 
effects noted are (a) alteration of shape, caused by turbulence with scale of ir- 
regularities greater than the thickness of the flame zero; (6) increase of flame speed, 
due to increase of heat conductivity by small-scale turbulence. The flamA theory 
used by Damkohler was of the simple Mallard type, and the rate of burning was 
proportional to A*, since diffusion of the reactant was neglected. The experimental 
behaviour was in reasonable agreement with the semi-quantitative theory. 


(c) Bmndary amditions 
0 = e- (DIMV) dejdx, 

so that (4) and (5) become dQjdx = M/MV 


and 




( 20 ) 

( 21 ) 

( 22 ) 


The condition at the hot end of the flame is e = 1 and T = T^ simultaneously. 

Although this condition is sufficient to determine a solution, it may be usefol to 
mention the behaviour of certain other quantities at and near the hot end of the 
flame. Provided the reaction rate M is not infinite anywhere, we deduce from (8) 
and (9) the continuity of de/dz and dTjdx. These are zero in the fuUy reacted gas, 
and hence also at the hot end-point of the flame. Prom (20) it follows that O tends' 
to unity continuously as the hot end of the flame is approached. 


(d) Approxirmte solutions 

Bo5rs & Comer (1949) have shown how the equations, neglecting diffusion, can 
be solved by a method of successive approximation. The same method will now be 
apphed to the equations with diffusion. The method will first be appKed to a first- 
order reaction, for which the equations are most easily handled. In this case 

It IS assumed that the specific heats of reactant and products are c eal./g. and that 
c, A ^d D/A can be assumed independent of temperature. In real gases, c increases 
slowly with temperatiire, and A is proportional to T* very closely; D/A is proportional 
to T, roughly. It will appear that the flame extends to an infinite distance on the 
hot side, so that dT/dz tends to zero as and e tend to 1. Hence 


Also, from (10), 

Prom (20), (24) and (25), 


dTjdx = M{c{T-T„) + Q(1 - (?)}/A. 
V = BTil+neyPW. 


6 = e-I)PW{c(T-TJ + Q(l-G)}{deldT)IXBT{l+m). 


(24) 

(25) 

(26) 



Effect of diffusion of main reactants on flame speeds in gases 397 

THs can be turned into an explicit equation for (? as a function of e, T and dejdT, 
It is’more convenient, however, to use it to find dejdT from e, O and T, giving 

deJdT = XRT{l + ne){e^G)IDPW{c(T^TJ + Q{l^G)}, (27) 

Equations (24) to (27) apply to any form of reaction velocity. Using (24) and the 
reaction rate (23) in (21), 

dGjdT = XB:i^(l-e)PWe-^f^^lM^ST(l+n€){c{T^TJ-^Q{l^G)^^ (28) 

We now find a first approximation, valid near the ‘ burnt ’ end, by keeping only the 
terms dominant in this region. Write 

M^RjXBiPW = x> AjBiPW +n) c = 

XBTJl+n)lcDPW^y, c{T^^T)lQ^^, = = ) 

(29) 

71, ^ are all positive by definition, and tend to zero as the hot boundary of the 
flame is approached. The approximate equations are, from (28), 



d^ XB^ P W yj 

dg - cM^RTJl +n) (C-g) -g-g’ 

(30) 

and from (27) 

d7} XBTJl+n){g-v) Jg-v\ 
dg cBPW(g-g) ~^g-gr 

(31) 

The solution of these equations, passing through g = ^ = ^=0, is 



Tt^eg, g=fg, ) 


where 


(32) 


For a second approximation (32) is used to eliminate e and T in the exact equation 
(28), and the resulting separable equation between G and T can be int^rated. This 
is the same as the procedure adopted for the equations without diffusion. When 
D = 0, fijy = 0, and therefore 0=1 and ^ = ^, as, of course, must happen in this 
case by definition. The equation between G and T is 

dGfdT = XB:,PWe''^^^^e{l-G)lMmT{l^n^ne(l^Q)}{Q{l-G)--Q{l-G)lf} 

= 1 ){ 1+^— ^(1 — <?)}• 

Hence (1 +») ((?- 1)+^(G- 1)* = ^ 

(33) 

- j(PQ\a\^ 

When <? = 0, P is so small that is negligible. Hence the equation which 

determines the flame speed is 

(34) 

M is found from which enters (34) through and 6; the latter is a function of 
which is determined by Equation (34) could be turned into an explicit expression 
for Xi bulj this would be cumbersome. It is easy to find % firom (34) by a few trials. 



398 


J. Comer 


Whem D = 0, (34) reduces to tite result 

A numerical example -will show the order of magnitude of the effect of 
A typical case would be: a unimolecular decomposition of a substance of molecular 
weight 75, into 5 molecules of average molecular weight 15; initial temperature 
300°K; final temperature 2500'’K; specific heat = 0-4cal./g.; heat conductivity 
2x 10-^ cal./cm. sec. deg. 0.; both these are assumed constant through the flame. 
The activation energy A is assumed to be 30 kcal./mole. 

Table 1 shows the variation of x and x^ with the value of DP. is proportional to 

the flame consumption M. To give an idea of the magnitude of the diffusion rates in 
the table, it may be noted that for NjO-COj diffusion (extrapolated to 2000° K), 
DP is about Scm.^/sec. Table 1 shows that for such a diffusion coefficient, the 
speed calculated by neglecting diffusion would be more than twice the proper value. 

Table 1. Effect of diffxjsion- on a ftrst-ordbe flajvie 


BP 

(atm. X cm.^sec.) 
0 

1 - 25 

2 - 5 
5 

10 


21-8 xl0-« 
10-6 xlO-8 
6 - 37 x 10-8 
3 - 40 x 10-8 
1 - 74 x 10 -^ 


4-67 X 10 -* 
3-26 X 10-8 
2-62 X 10 -^ 
1 - 84 x 10-8 
1-32 X 10-8 


A numerical integratioh of a particular first-order flame is discussed in § 5(e) 

It IS shown that in this case the approximate solution gives a flame speed about 5 V 
too big. 

The solution is almost the same for the type of reaction denoted by ‘ Case IH’ iu 
the paper by Boys & Comer (1949): 

^ = J?s(l -e) (1 -i-«e) 

This means that reaction takes place in sufficiently violent collisions of reactant with 

any type of molecule; this could be the low-pressure form of a unimolecular decom- 
position. Let 

e-^lRT„i^Tl{l+n)c = XBTJl+n)lcDPW = y. (37) 

^ -8 ^ 

A bimolecular reaction (case U) can be solved by the same methods, but not 
wi bout a considerable amount of numerical computation. Three tables (2 3 and 41 
^ given here, which enable flame speeds to be calculated in a few minutes. These 
tobies are ^hwed to cover most of the cases that wiU be encountered in practice, 
ine method of construction is described in the appendix. 

It will now be explained how to use these tobies. The reaction rate is 

^ = ^ 2(1 -efo-^RTjy^ 


Write 


(38) 


^TJl+n)lcDfW.y. (39) 



Effect of diffusion of main reactavis on flame speeds in gases 399 . 

The definition of ^ is somewhat different from that in case III. The equation which 
determines the flame speed is 

7^o/y?-{2n/(l +n)} {yff}^j^°NdE+{nl(l+n)f = (cTJQ) (BTJA). 

(40) 

In tables 2, 3 and 4 are listed yE^jf, {ylfi)^j N dE and (y//?)® J ‘N^dE for certain 

values of y and /?y/(y-h 15), This choice of variable makes the results for diEFerent 
y more easily comparable. 

To avoid interpolation of double entry tables, the flame speed should be found 
in the following way. From the given data, calculate y and pick the nearest of the 
tabulated y. With this y and an assumed yS/y enter tables 2, 3 and 4, and calculate 
the left-hand side of (40). Eepeat until (cT^^JQ) {RT^jA) bas been bracketed, and 
find jSjy by interpolation. This gives JS corresponding to the y used. The true diflFusion 
coefficient varies by a factor of as much as two in going through the effective region 
of the reaction zone, and therefore the y used in entering the tables wiU often 
correspond to a value of D which is probably as good as that in the original data. If, 
however, it is required to find the flame speed corresponding exactly to a given 
non-tabulated y, repeat with a different (tabulated) y, and interpolate to find yff 
corresponding to the true y. This interpolation is easily carried out, since the tables 
are given at equal intervals of logy. 


Ar 

y+15 



Table 2. 

r^o/A 

r 

A 




^0«5 

1 

2 

4 

8 

16 

00 

1 

0-390 

0-376 

0-380 

0-404 

0-447 

0-510 

0-596 

1-5 

0-354 

0-337 

0-336 

0-351 

0-384 

0-431 

0-517 

2 

0-330 

0-311 

0-306 

0-316 

0-342 

0-381 

0-461 

3 

0-298 

0-277 

0-268 

0-272 

0-290 

0-318 

0-386 

4 

0-276 

0-255 

0-243 

0-244 

0-257 

0-279 

0-335 

5 

0-260 

0-238 

0-226 

0-224 

0-233 

0-251 

0-299 

6 

0-248 

0-226 

0-212 

0*209 

0-216 

0-230 

0-271 

7 

0-238 

0-216 

0-202 

0-197 

0-202 

0-213. 

0-248 

8 

0-229 

0-207 

0-193 

0*187 

0-190 

0-200 

. 0-229 

9 

0-222 

0-200 

0-186 

0-179 

0-181 

0-189 

0-214 

10 

0-216 

0-193 

0-179 

0-172 

0-172 

0-179 

0-201 

11 

0-210 

0*188 

0-173 

0-165 

0-165 

0-171 

0-190 

12 

0-205 

0-183 

0-168 

0-160 

0-159 

0-164 

0-180 

13 

0-201 

0-179 

0-163 

0*155 

0-153 

0-157 

0-171 

14 

0 197 

0-175 

0-159 

0-151 

0-149 

0-151 

0-163 

15 

0*193 

0-171 

0-155 

0-147 

0-144 

0-146 

0-156 

16 

0-189 

0-168 

0-152 

0-143 

0-140 

0-142 

0-150 


Table 5 shows the effect of diffusion on a flame with the following characteristics : 
second-order reaction (case II); iZo = SOO°K; 35^ = 2500° K; c = 0-4 caL/g, ; 
^ = 30 kcal./mole; W = 75; n — 4; heat <x>nductivity 2 x 10“^ cal./cm. sec. deg. C. 
The effect of diffusion is of the same order of magnitude as for a flame with a first- 
order reaction. 



400 


J* Comer 


h 

r+15 

1 

1-5 

% 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


h 

7+15 

1 

1-5 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


Table 3. 





(Values in brackets were found by interpolation) 


r 


0-6 

1 

2 

4 

8 

16 

00 

0.-052 

0-071 

0-095 

0-125 

0-155 

( 0 - 180 ) 

0-255 

0-042 

0-057 

0-077 

0-100 

0-125 

( 0 - 147 ) 

0-212 

0-036 

0-048 

0-065 

0*085 

0-107 

( 0 - 127 ) 

0-183 

0-029 

0-038 

0-051 

0-067 

0-084 

0-099 

0-146 

0-024 

0-032 

0-042 

0-056 

0-071 

0-084 

0*123 

0-021 

0-028 

0-037 

0-049 

0-061 

0-073 

0*106 

0-019 

0-025 

0-033 

0-044 

0-055 

0-066 

0-093 

0-017 

0-023 

0-030 

0-039 

0-050 

0-059 

0-083 

0-016 

0-021 

0-028 

0-036 

0-045 

0-054 

0-076 

0-015 

0-020 

0-026 

0-034 

0*042 

0-060 

0-070 

0-014 

0-019 

0-025 

0-032 

0-039 

0-047 

0-064 

0-013 

0-018 

0-023 

0-030 

0-037 

0-044 

0*059 

0-013 

0-017 

0-022 

0-028 

0-035 

0-041 

0-055 

0-012 

0-016 

0-021 

0-027 

0-033 

0-039 ■ 

0-052 

0-012 

0*015 

0-020 

0-025 

0-031 

0-037 

0-049 

0-011 

0-015 

0-019 

0-024 

0-030 

0-035 

0-046 

0-011 

0-014 

0*018 

0-023 

0-029 

0-034 

0-044 


0-5 

0-009 

0-006 

0-005 

0-004 

0-003 

0-002 

0-002 

0-002 

0-001 

0-001 

0-001 

0-001 

0-001 

0-001 

0-001 

0-001 

0-001 


T^lb 4. (7//9)3j^‘jV2d® 

(Values in brackets were found by interpolation.) 

r 


1 

2 

4 

8 

16 

00 

0-018 

0-033 

0-055 

0-081 

( 0 - 109 ) 

0-159 

0-013 

0-024 

0-041 

0-062 

( 0 - 085 ) 

0-130 

0-010 

0-019 

0-033 

0-050 

( 0 - 070 ) 

0-111 

0-007 

0-013 

0-023 

0-037 

0-051 

0-087 

0-005 

0-011 

0-018 

0-029 

0-041 

0-072 

0-005 

0-009 

0-015 

0-025 

0*035 

0-061 

0*004 

0-007 

0*013 

0-021 

0-030 

0-054 

0-003 

0-006 

0-011 

0*018 

0-027 

0*048 

0-003 

0-006 

0*010 

0-016 

0-024 

0-043 

0-003 

0-005 

0-009 

0-015 

0-022 

0-039 

0-002 

0-005 

0-008 

0-014 

0-020 

0-036 

0-002 

0-004 

0-008 

0-013 

0-018 

0-033 

0-002 

0-004 

0-007 

0-012 

0-017 

0-031 

0-002 

0-004 

0-007 

0-011 

0-016 

0-029 

0-002 

0-003 

0-006 

' 0-010 

0-015 

0-027 

0-002 

0-003 

0-006 

0-009 

0-014 

0-026 

0-002 

0-003 

0-005 

0-009 

0-013 

0-025 


5. EiTECT OB DIETD-SIOIT OK A SECOKB-OEDEE BLAME 


DP 

(atm. X cm.2/sec.) 10’“^;!^i 

0 3-16 

0 - 85 2-28 

1 - 71 1-77 

3-42 1.12 

6-84 0-63 

13-68 0-33 



Effect of diffusion of main reactants on flame speeds in gases 401 

The behavioxir of e near the cold side of the flame is not immediately obvious from 
this method of approximation. Q of course decreases contmually from 1 to 0 on 
passing through the flame from hot to cold side. At the cool end there is a large region 
m which there is little chemical reaction, and so G = 0. In this zone, e and T are 
connected by the relation, derived from (27), 

dejdT = ^.BT{l+n6)e|DPWe{T —Tfj), 
of which the solutions are 

effL+ne) = 11 exp j^j (T - T^Y, (41) 

where ^ is an arbitrary constant, and q = RT^lDPWc. 

(e) Detailed structure of a special case 

To test the method of approximation, a typical first-order flame has been solved 
exactly by numerical integration of the equation. The case taken was that already 
given as an example (table 1), with DP = 2*5 (cm.^/sec.) x atm. The approximate 
method shows that ^ x 10"^. Numerical integration with a few trial values 

of X showed that this flame would reafly have = 5*76 x 10“®. In this case, therefore, 
the approximate method gives a flame speed about 5 % too large. This error is of 
the same order of magnitude as in the case of negligible diflFusion (Boys & Comer 
1949). 

The numerical integration was started at the hot boundary of the flame, with the 
first approximation as a guide to the behaviour of the solution in this region. The 
integration was carried out in steps of 60° K near the highest temperature and then 
insteps of 100° C. down to 1000° K, Below this there was practically no chemical 
reaction, and the value of G at the lower boundary {T = 300° El) was easily 
estimated. For the true %, (? is zero here, and hence the proper x found by an 
interpolation of the G's for various x- 

For the case of negligible difiPosion, it had been found convenient to start the 
integration at the cold end. This was not possible in the present case, because near 
the cold end e is determined by equation (40), in which there is an unknown constant; 
the particular solution needed is settled by the behaviour at higher temperatures 
where G is not zero. Therefore it is simpler to start at the hot boundary, even though 
the convergence of the numerical integration process is rather tricky here. 

The detailed structure of the flame is shown in figure 1 and 2, in which e and G 
are plotted against T and x{B^P)^ respectively. G vises smoothly from 0 to 1, and 
below 1000° K is effectively zero. On the other hand, e does not approach zero until 
close to 300° K, and therefore e is needed in a temperature region where D is much 
smaller than at high temperatures. The e curve of figure 1 and 2 has the following 
parts: above 1000° K, it has DP = 2*6; below 1000° K, curve I has DP = 2-5 and 
curve n has DP = 0*2. The latter is a reasonable average for 300 to 1000° K, if we 
assume that DP = 2*6 refers to a temperature in the ne^hbourhood of 2000° K. 
It can be seen that even with the smaller diffiiaon coefficient there is an appreciable 
diffusion of products ahead of the flame. The case of a diffusion coefficient which 
depends on temperature wfll be sufficiently dear from the two curves given. 



402 


J. Comer 


i| i r I — I 


0 - 75 - 

0 - 5 - 

0-25 

0 


/ 


n 


I 


300 500 900 1300 1700 

temperature (°K) 



2100 2500 


Figtoe 1. Structure of a t37pical jSrst-order flame. Data in text. 



Figube 2. Structure of a typical first-order flame. Data in text. 


Write 


Appendix 

A.^proocirruite eolvMon of flame with second-order reaction 


= l-e = 9/, = ^ I (^3) 

The only difference from the notation of case in is in the definition of J3. As usual 

= ( 44 ) 

The exact equation for the reaction is 




Effect of diffusion of main reactants on flame speeds in gases 403 


and neax the hot boundary this reduces to 

« = (46) 

ffrom (44) and (46), (47) 

Write ^ = 7©//^ V = 7^ © i® positive for sufficiently small N, since dejdT 
is positive for T sufficiently close to T^. (47) becomes 

l+d®ldN = N^I&. (48) 

The solution required is that with 0 = 0 for N = 0, and @ positive for small N. This 
solution behaves as © = for small N. Put 

© = N^iN). (49) 

Then 1 + ^Nf+WdfjdN = 1//. (50) 


As N tends to zero, / tends to unity. There is no useful solution in the form of a power 
series in N near N = 0. Numerical integration is also impracticable, at any rate 
in the range of N between 0 and 0-5, because N^dfjdN is obtained as the differaace 
of two nearly equal quantities. This fact assures the rapid convergence of a process 
su^ested by Dr A. P. Devonshire: N^dfjdN is neglected, and (50) solved for /as 
a function of N; this first approximation is used in the N^dfjdN term, and the 
quadratic for solved. This process can be taken to the third approximation quite 
easily. Table 6 shows that the convergence is rapid even for N as large as 10. For the 
present purpose it is sufficient to use the third approximation. 


Table 6. SxrccBssivB APPBOXiMATioiirs to / » 


N 

A 

/a “A 

A A ""A 

A 

0-25 

0-7321 

166 

0-7487 - 10 

0-7477 

0*5 

0-6181 

242 

0-6423 -14 

0-6409 

2 

0-3904 

293 

0-4197 4 

0-4201 

10 

0-2000 

208 

0-2208 16 

0-2224 

The formulae are = 

[{(1 +EJ^ + 8N}i-{l+EJ-]liN, 


where E^ = 0, K^-- 


1 + 4N ~ 

(l + 8W)tJ’ 

, and 


1 + X, 

, {(l+E,r+8N}i 

^^(i+a:^) 


■6^3= 4 


4 

(l + 8W)»{(l-fZa)*+8W}i 



N 

'^{(l+K^)^+8N}i ^ (1 + SAT)* ■ 


The following asymptotic formula (suggested by Mr E. P. Hicks) was used for 
N greater than 8: 

© = 0-81 64966N* - 0-4N + 0- 146969N* - 0-048 + 0-022045JV-* - 0-014109N-1 + . . . . 

(51) 

Write g = yEjp. Substituting this and ^ = y[N +N^f(N)]lfi in (46), 

N+]!Pf-E=: yN^fdEjdN. 


(52) 



404 


J. Comer 


The solution is needed which has ^ = 0 at ^ = 0 . Solutions for certain special 
values of 7 are: 7 = 1, ^ = iV; 7 = 0 , jr = N+NJ; 7 = oo, ^ = 0. In the general 
case, write 

JE = N-'^{N). (53) 

dT/rjdN + i/rlyN^f = (y—l)ly. ( 54 ) 

rN 

Let J ^ dNJN^f^N) = ^{N). The required solution of (64) is 

= {(r-l)/7}e-^/yJ &^'>lydx. (55) 

For small and large 7 the computations are more easily carried out with sightly 
diffeent forms of solution. 

Using the table off(N), the function E was tabulated for various values of 7 and.W^. 
The second approximation is now constructed in the usual way. The equation 
connecting Q and T is 

dQjdT = il-e)^e-ilii^lxT^(l+ne)^{c{T-TJ + Q{l-0)} 

= Q(^-i) (1 +n-nv)^. 


Hence 




The left-hand side = Ee-^^^mjQAx, since is small compared to e-^® 2 ’». 

The right-hand side of (56) is 

Jo (1 +n-nyNlJ3)HE, 

where E — E^ when C = Ij that is, when 

(57) 

Let ^0 be the solution of this equation. Hence 

y{l+n)^E^-2n{l+n)^ NdE+^ NHE = R&-mT„,IQAx, 
which reduces to 


y-gp 2n MV® 


y? 1+«U?/ Jo 


WdE+ 


)2(y)3fB. 


1+W \Al Jo 


NHE = {cTJQ) (ETJA). (58) 


This equation determines J3 and so M. 

The upper limit E^ is found by solving (57) for as a function of fijy, and hence 
finding E^^ &s a function of 7 and fijy. Integrals such as are found by 

integration by parts; for example, ^^NdE = N^{E^-NJ2)+ ^‘irdN. For these 

r r Jo 

purposes, J^diV' and were computed. 

In §6 (d) have been tabula,ted yEg/^, (y/j9)^j^°EdE, and (y/y?)* (tables 

2 , 3 and 4 respectively). They are functions of yff and 7 only. ** 



Effect of diffusion of main rexidwtis on flame speeds in gases 406 

The auxiliary funotions/(JV) and E{N, y) do not appear to have any other applica- 
tion, so they are not given here. 

For zero diffusion, tables have been given previously (Boys & Comer 1949). The 

functions yE^jd, (t//^)® ‘NdE and ° N^dE for the limit of zero Z) (infinite 

y) were tabulated in that paper as gz(llfi) and g'ailjd). They have been in- 

corporated in the present tables. 

Certain results for large y and small fiyl{y+ 15 ) were found by interpolation, to 
avoid a lengthy re-computation of certain functions at smaller intervals. These 
results are given in brackets. 

I am indebted to Dr S. F. Boys for many discussions, to Professor Sir John Lennard- 
Jones, F.R.S., for his continued encouragement of this research, and to the Chief 
Scientist, Ministry of Supply, for permission to publish this paper. 


Boys, S. F. & Comer, J. 1949 Proc. Boy, Soc, A, 197, 90. 

Coward, H. F. <fc Hartwell, F. J. 1932 J, Ghem, Soo, pp. 1996, 2676. 

Coward, H. F. & Payman, W. 1937 Chem. Bev, 21, 269. 

Crussard, L. 1914 C,B. Acad, Sci,t Paris, 158, 125, 340. 

Damkohler, G. 1936 Z, Elektrochem, 42, 846. 

Bamkohler, G. 1940 Z. Elektrochem, 46, 601. 

Baniell, P. J. 1930 Proc, Boy, Soc, A, 126, 393. 

Jost, W. 1935 Z, Elehtrochern, 41, 183, 232. 

Jost, W. 1936 Z. Elehtrochern, 42, 461. 

Jost, W. & Muffling, L. V. 1937 Z, phys, Ghem, A, 181, 208. 

Jouguet, E. 1913 a G,B, Acad, Sd,, Paris, .156, 872. 

Jouguet, E. 19136 G,B, Acad, Sci,, Paris, 156, 1058. 

Jouguet, E. 1924 G,B, Acad, Sd,, Paris, 179, 454. 

Jouguet, E. & Crussard, L. 1919 G,B, Acad Sd,, Paris, 168, 820. 

Lewis, B. & von Elbe, G. 1934 J, Ghem, Phys, 2, 537. 

Lewis, B. & von Elbe, G. 1938 a Gombustion, flames and ^plosions of gases, p. 160, Cambridge 
University Press. 

Lewis, B. & von Elbe, G. 19386 GorrbiLStiori,flmnes and exphsiom of gases, Qb,'SJi, Cambridge 
University Press. 

Mallard, E. 1875 Ann, Min,, Paris, 7, 355. , 

Mallard, E. <fc le Cbatelier, H. 1883 Ann, Min,, Paris, 4, 274. 

Nusselt, W. 1915 Ver, dtsch. Ing. 59, 872. 

Zeldowitsch, J. & Frank-Kamenetsky, D. A. 1938 G,B, Acad, Sd, U,B,S.S, 19, 693; Acta 
Physicochim, 9, 341. 


VoL 198- A. 


27 



Paramagnetic resonance in the copper Tutton salts 

By B. Bleaney, R. P. Peitbose* ae’d Betty I. Pltjmptoe* 

Glarendon Laboratory , University of Oxford 
{Comrnwmcaited by F, Simon, F.It,S. — Received 27 January 1949) 

F<srt I. Seven of the double sulphates and selenates (‘Tutton salts’) of copper have been 
examined by the method of paramagnetic resonance. The deviation of the effective gyro- 
magnetic ratio &om the free-^in value of 2 is shown to be generally consistent with the 
theory of Polder who assumes a crystalline electric held of tetragonal symmetry to 

act on each ion. In some salts an appreciable departure from tetragonal symmetry is observed. 

Pcsrt Urn The variation of the width of the absorption lines with the direction of the 
applied magnetic field is studied and compared with that calculated from magnetic dipole 
interaction. The line shapes show that exchange interaction is present in varying degree, 
though much less powerful than in copper sulphate. The ‘mean square moment’ of the lines 
AAn be explained by the magnetic dipole interaction, together with a contribution from a 
hyperfine structure which is not resolved except in a highly diluted salt. 

PART I 

1. Intbobtjctioe’ 

Tty the early appKcations of the method of paramagnetic resonance it is natural to 
choose substances whose spectra offer the greatest chance of a simple theoretical 
explanation. This requires salts whose crystallographic structure has been accur- 
ately detennined; in addition, they must be fairly ‘ dilute ’ in the magnetic sense, in 
order to give narrow absorption lines, and to minimize the effects of exchange forces. 
Tn the iron group this suggests the use of the double sulphates: the alums, for the 
trivalent ions, and the monoclinic Tutton salts for the divalent ions. The chemical 
formulae are of the type ilf'Jf(S04)2,12H20 and respectively, 

where ' is the trivalent ion, the divalent ion and M a monovalent diamagnetic 
ion. There exists, therefore, for each paramagnetic ion a series of salts formed by 
using different monovalent ions, principally K, NH4, Rb, Cs and Tl; further variety 
may be obtained by using selenates instead of sulphates. 

The simplest paramagnetic ion is in which the orbital momentum is 

effectively quenched and the spin 5 J corresponds to a single electron. One of 
its salts, ChiS04,5H20, has been investigated by Bagguley & Griffiths (1948), who 
find that exchange forces play a dominant role m determining the behaviour of the 
energy levels. It is therefore particularly interesting to examine the more dilute 
Tutton salts, in which the crystalline elecriic field should be remarkably similar to 
that in the simple sulphate. This paper reports the results of measurements on seven 
copper Tutton salts, five sulphates and two selenates. 

2. CBYSTAIiLOaRAPHY AUTD SUSOEPTIBirilTY 

The Tutton salts form a monochnic series, in which the three crystallographic 
axes (a, 6, c) are very closely in the ratio (3, 4, 2). The 6-axis is normal to the plane 

* Dr Penrose died suddenly at Leiden on 28 April 1949 while this paper was in the pre®3. 

[ 406 ] 



407 


Paramagnetic resonance in the copper Tutton salts 

contaiidiig a and c; the a-axis makes an angle of 105° "with the c-asds, measured 
positively in the anti-clockwise direction from the latter (see figure 1). No X-ray 
measurements on "flie copper salts exist, but following Polder (1942) we shall assume 
that the structure is similar to that of Mg(NH4)2(S04)2,6H20, investigated by 
Hofmann (1931). The unit cell contains two molecules, one derived from the other 
by a translation from the point (0, 0, 0) to (|, 0) followed by a reflexion in the 

ac-plane. The divalent metallic ions lie in the points mentioned, surrounded by an 
octahedron of water molecules, of which four lie very nearly in a square each at 
a distance of l-^A from the Mg++, and the other two at 2'15A. The crystalline 
electric field should therefore have approximately tetragonal symmetry about an 
axis along the line joining the more distant waters. The two tetragonal axes in 
unit cell are equally inclined to the ac-plane, making an angle (a) of about 25° 
with it in the Mg"^ salt. This angle is probably somewhat different in the Cu*^'^ 
salts owing to the increased size of the ion. 

&-axis 




Figube 1. Crystallographic {a, 6, c) and magaetio K^) axes. 


On these assumptions. Polder shows that the orbital levels of the Cu"*^ ion should 
split into three singlet levels Fi, and F^; and one level, doubly degenerate 
except for a small splitting due to the spin-orbit coupling. F^ li^ lowest, some 
10,000 cm.”’^ below the next level. 

Introduction of the spin of f makes doubly d^enerate, with a magnetic moment 
. I). Owing to the residual spin-orbit coupling, the value of ^ is not exactly 2, 
and is anisotropic, with the values 


parallel to the tetragonal axis: 
perpendicular to the tetragonal axis: 


?i = 2i 




r. A \ ’ 

9'i = 2| 

1 

1 


( 1 ) 



408 


B. Bleaney, E. P. Penrose and Betty I, Plumpton 

where A is the spin-orbit coupling coefficient == — 852 Since lies above F^, 

flTj will be greater than and the corresponding susceptibilities per g. ion along 
and normal to the tetragonal axis will be 



0-376 j 

f a ^ 

\\ 2-1 1 

Xi — 

T ‘ 

r 

H 

1 

* v — 

0-376 

(l- ' 

f 0-53 

Ax “ 

T ' 

r f,-f^j 



where F^ and F^ are in 

Combining these results for the two ions in unit cell, one finds that of the principal 
axes of the elHpsoid of susceptibility, two lie in the oc-plane, one [K^ being the 
bisector of the tetragonal axes, the other (Jf 2) being perpendicular to both these 
axes, while the third {K^ lies along the crystallographic 6-axis, which forms the 
other bisector of the tetragonal axes (figure 1). The corresponding susceptibilities are 

= Xi, cos2 a -h Xj, sin2 

A2 = Xx. \ (3) 

Xz = X« sin2 a + Zx cos^ a, j 

where a is the angle between either tetragonal axis and the oc-plane. From the 
susceptibility measurements of Hupse (1942) on CuK2(S04)2, BHgO over the range 
290 to 1-5° K, Polder deduce the following values: * 

9r^:=2-44, sr^ = 2-05, a = 40^ 

which are in reasonable agreement with his calculations on the crystalline electric 
field. Measurements over a more restricted temperature range by Krishnan, Chak- 
ravorty & Banerjee (1933) Krishnan & Mookherji (1938) on other copper Tutton salts 
suggest that their behaviour is similar to that of the potassium salt. 

3. The pahamagnetio besokai^ce expebiments 

In measurements of susceptibility only the net effect of the two ions taken to- 
gether can be observed, and Polder points out that this leads to an ambiguity in the 
values of gj_ and a. A priori, it would have been equally valid to assume that the 
K-i axis was perpendicular to the tetragonal axes, and that was a bisector. This 
would give = 1 -98, = 2-27 and a = 28®, and although the value of a agrees more 

closely with that deduced firom the X-ray measurement^ of the magnesium salt, 
the agreement with the tetragonal field theory for the gr-values would have been 
completely upset. Paramagnetic resonance experiments have the advantage that 
each ion in unit cell gives its own absorption fine whose position is determined by the 
angle at which the applied magnetic field is inclined to the tetragonal axis. If this 
angle is then the effective g^-value is given by 

gr2 = g\ cos^ sin^ (4) 

(this formula was kindly derived for us by K. W. H, Stevens). At a frequency 
V (cm.~^) the absorption line will occur at a field where 



409 


Paramagnetic resonance in the copper TuUon salts 

HEq is measured in kilogauss, this reduces to 

g = 2h4:vlHQ. (5) 

Thus the absorption lines for the different ions will occur (in general) in different 
fields; hence it is possible to measure g for each ion separately, and to locate 
its tetragonal axis, along which g should be a maximum. The values of and gj_ 
can then be measured directly. In addition, by making measurements in the plane 
normal to a tetragonal axis it should be possible to determine how far the assumption 
of tetragonal symmetry is vaM. 

In this investigation the following measurements were carried out: 

A. The variation of ^ in the oc-plane. 

B. The value of g^ along the 6-axis 

The two tetragonal axes are equally inclined to the ac-plane, and to the 6-axis; 
the absorption lines for the two ions will therefore coincide.* The maximum and 
minimum values of g {g^ and g^) in A determine the positions of and 

These two sets of measurements (A and B) are equivalent to the susceptibility 
measurements of Hupse and of Krishnan et al., but they have the advantage that 
there is no correction for diamagnetism or temperature-independent paramagnetism. 
These measurements provide no further check on Polder’s theory, nor do they 
identify the plane containing the tetragonal axes. The necessary evidence is furnished 
by the following experiments, which illustrate the particular advantages of the 
paramagnetic resonance method. 

C. The variation of ^ in the planes and (Z’g jS^s). 

D. The variation of gr in a plane normal to a tetragonal axis. 

If Polder’s theory is correct, the tetragonal axes Ke in the plane and two 

resolved lines should appear in this plane. On the other hand, only one line should 
appear in the plane, since it is equally inclined to both tetragonal axes. 

Prom the values g-^, g^, g^ for the principal susceptibility axes, the values of g^, g^ 
and a can be calculated. These can be compared with those directly measured. 

Measurements A and B were made most conveniently at a wave-length of 3*2 cm. 
as also were the initial measurements of C. These were sufficient to show that two 
lines were obtained in the plane and only one in {K^K^. The two lines were 

not well resolved, and to make accurate measurements, a wave-length of 1*3 cm. 
was used for 0 and D. 

These four sets of measurements provide a complete check of Polder’s theory, 
and the results are presented in § 5 of this paper. In addition to the positions of the 
r^onance lines, which determine the values of g^ there is another parameter whose 
study is of great interest. This is the line width and line shape, which the preliminary 
experiments showed to vary markedly with orientation of the crystal and from salt 
to salt. The discussion of this question is given together with the r<^ults of the 
mea»surements, in part 11. 

* Since the one ion in unit cell is derived from the other by a reflexion in the ac-pkne (plus 
a translation), the two absorption lines in this j^ane will coincide whatever the syimnetry of 
the crystalline field. 



410 


B. Bleaney, R. P. Penrose and Betty 1. Plumpton 


4. Mbthob astd ahpaeattjs 

In view of the low susceptibility of the copper Tutton salts, and the fact that only 
gTnfl.n single crystals could readily be grown, it was decided to make the measure- 
ments principally at 90° K. At this temperature the dielectric losses are insignificant, 
and it is not necessary to grind the crystals down to rednoe the amount protruding 
into the r.f. electric field, as is generally the case at room temperature. 

The apparatus used is simple, and merits only a brief description. That used at 
3 cm. wave-length has already been outlined (Bleaney & P enrose 1 94 ^) > the apparatus 
for 1*3 cm. is essentially the same. A fixed-tuned cavity, one-half a wave-length 
long and resonant in the mode, is excited through hole-couplings from rectangular 
wave-^des contracted of thin-walled german silver tubing. It is cooled by means 
of liquid oxygen contained in a small dewar with a narrow tail. By this means a gap 
of 24mm. between pole-faces tapered to 1 Jin. could be used in the magnet, giving 
fields up to 13 kG. 

The experimental method is as follows at both wave-lengths. Power is fed to the 
cavity from a reflex klystron oscillator whose firequency is adjusted to the cavity 
resonance. A small firaction of the power enterii^ the cavity is fed out through a 
second coupling to a silicon-tungsten rectifier crystal, whose d.c. current is read by 
a sensitive galvanometer. This current is closely proportional to the r.f. power 
incident on the rectifier, and hence to the square of the magnification factor Q of 
the cavity. When the latter is lowered because of absorption of power by the para- 
magnetic crystal in the cavity, the rectified current falls firom its initial value dj to 
some value and the paramagnetic absorption coefficient 7 (in arbitrary units) 
can be calculated firom the formula 

To reduce y to absolute units is difficult, requiring a mearsurement of Q and the 
filling factor of the crystal iu the r^onator ; fortunately, the main interest lies in the 
shape rather than the absolute intensity of the absorption curves, and this reduction 
was not carried out. 

In an experiment, a variable d.c. magnetic field is applied normal to the r.f. 
magnetic field, and the galvanometer reading 8 is observed as a function of 
At each reading the frequency of the klystron is adjusted to be in exact resonance 
with the cavity. This is necessary in order to correct for slight frequency fluctuations 
in the oscillator, and for the detuning of the cavity owing to anomalous dispersion 
in the paramagnetic salt. The background of fluctuation corresponded to a few 
per cent of the maximum of an absorption line, the shape of which could therefore 
be deteimined well out into the wings. 

The magnetic field was calibrated initially by means of a search coil and flux- 
meter, and then over the limited ranges involved in the measurement of g by means 
of a ballistic galvanometer, which was standardized using the reversal of a known 
current in an accurate mutual inductance. The error in SJ, is thought to be less than 
1 %. 



411 


Paramagnetic resonance in the copper TuUon salts 
5. Results — ^vabiation oe g 

The first measurement (A) was of the variation of gr in the ac-plane, at intervals 
of 20°. From equation (4) it follows that, if the magnetic field is applied at an angle 
f to the c-axis, and the axis makes an angle with this axis, then 

gr2 =: gr2cos2(^-fi)4*g|sin2(V;r.«^^)^ (4^) 

where and gr^ are the values of gr along and Zg respectively. In aU eases it was 
found that the points could be fitted within experimental error by a curve of the 
type (4a). A typical curve is shown in figure 2 for copper ammonium selenate. It 
should be noted that in these experiments ‘ijr is measured in an anti-clockwise sense 
firom the c-axis, as in figure 1, the a-axis lying at an obtuse angle ^ = 105°. This 
corresponds to the notation used by Krishnan et ah (1933) and later Indian workers, 
and is of opposite sign to that of earlier measurements of Jackson (1923, 1926), 
Rabi (1927) and Bartlett (1932),* 



4*oL_l__i_— J i ^ 1 ! ^ j 

0 40 .80 120 160 
ijr = angle with c-axis (deg.) 

Fegube 2 . Variation of in the ac-plane for Cu(lUE4)2 OHjO. 

The results of the measurements (A) and (B) on the seven salts examined are 
shown in table 1. Column 2 shows the values of ^ deduced from the, positions of the 

* The comparison of these measnremfflats reveals some consideocable oogofi^on. Bartlett 
(1932) points out that in Rabi’s (1927) paper tbe values of 6 (our ^1) are interchanged fra: the 
pota^um and ammonium salts of both nickel and cobalt. Rrom a later correction (Bartlett 
1933) it appears that this is true also of the copper salts; at tip^ same time Bartlett accepts 
K^hnan’s correction that his own values of & siiould be reversed in sign. This makes them in 
good numerical agreement with Erishnan’s own resuMe, with the sano^ sign, though they 
appear to have been measured in the oj^posite s^ise. In fact, the values obtained bfy both 
Rabi and Bartlett suggest that they were really measured in the same ^nse as Krishnan’s; 
they would then agree well with the results reported in ibis paper. 



412 


B. Bleaaiey, R. P, Penrose and Betty I. Plumpton 

maximum and minimum values of gr in the oc-plane; the accuracy is probably not 
greater than ( + )3°. Por comparison the measurements of Krisbrian et al. (1933) for 

the first two salts, and of Krishnan & Mookherji (1938) for the others are quoted in 
column 3. 

Table 1 


salt 

(this 

paper) 

(Krish- 

nan) 

9i 

ff2 = ff± 

9z 

9, 

(this 

paper) 

(Krish- 

nau) 

sulphates: 

potassixma 

105 

102 

2-26 

2-05 

2-225 

2-43 

42 

42 

amiixLOzdxiDa 

65 

77 

2*27 

2*06 

2-20 

2-40 

39 

42 

rubidium 

105 

109 

2-295 

2*07 

2-23 

2-45 

40 

40 

thaUium 

112 

108 

2-26 

2-06 

2-20 

2-39 

39i 

40 

41 

caesium 

114 

108 

2-28 

2-06 

2-22 

2-43 


selenates: 

ammom'mrt 

72 

67 

2-24 

2-065 

2-17 

2-34 

37i 

37 

39 

potassium 

73 

126 

2 - 26 s 

2-045 

2-17 

2-38 

41 


A rough check was obtained on the values of by suspending the crystal by a 
t^onless fibre m a magnetic field, so that it was free to rotate about the 6-axis. 

Zj axis then sets paraUel to the field, and its position can be observed directly. 
The values obtained thus at room temperature, agreed closely with those given 

m column 1, except for the ammonium sulphate, for which the value was 77° This 
latter value agrees with Krishnan etcd. (1933); since for this salt a variation of ilr^ 
mth tem^rature has been reported (Bartlett 1932), a crude attempt was made to ' 
o s^e for this salt at 90° K. The value obtained was 69°, and a definite rotation 

co^dbeobserved as the crystal warmed up., The reasonfor the very wide discrepancy 
m the values of for the potassium selenate is unknown. 

The next three columns give the values of and g^ for the oc-plane, together with 
tne valim of ^3 obtamed by measurement with the magnetic field directed along the 
-a^. From t^e one can obtain the values of g, and a, assuming Felder’s theory 
to be correct. From equation (4) it can easily be shown that 


COs2a = (g'|-gr|)/(gra|_grj, ■ (g) 

with, of course, 

It wiU be ^n that the values of tend to cluster around either 110 or 70° and 
the values of a ^e slightly higher for the former group than for the latter. There is 
^^r (Matron m the case of y, and gr,, which are, of course, sensitive to the 
n^ude of the crys^e electric field. The estimated accuracy of the values of 

Le I ' ^ela^^ive values of g, which 
re probably rather better than the absolute values, which depend on the accuracy 

of the calibration of the ballistic galvanometer used to measure the magnetic field^ 

Sm^ m deducing the values of g., g, and a the same assumptions have been made 

Sw by K^hnan, it is convenient to compare the results with those of 

other methods now, before reporting the other experimental results. Only for copper 



413 


Paramagnetic resonance in the copper TvUon salts 


potassium sulphate has the susceptibility been measured over a sufficient temperature 
range to obtain accurate values of the Curie constants (Hupse 1942). I'rom these 
one finds 

gTjj = 2*4:4, gj^ = 2*05 and a = 40°, 


which are in very good agreement with those reported in table 1. 

It is possible to obtain values of cc for all the salts from the anisotropy measure- 
ments at room temperature of Krishnan et al, (1933, 1938), if the assumption is 
made that the diamagnetic susceptibility is isotropic. The theory of Polder shows 
that the temperature-independent paramagnetism has tetragonal symmetry about 
the same axis as the temperature-dependent portion. Consequently, one can cal- 
culate a from the total anisotropy at any temperature, since from equation (3) it 
follows that 


cos 2a = 


iXi-Xs) 


HXi-X^)-iXi-Xzy 


The values of a obtained by this means from Krishnan’s measurements are given in 
the last column of table 1 (see also Mookherji 1945). The agreement with our values 
is close, though not exact. It is, however, difficult to estimate the error that may arise 
from the assumption of diamagnetic isotropy in reducing Ejrishnan’s values; and 
small changes in a with temperature may occur. 

Experiments G and D, The variation of in the planes and was not 

studied systematically, but by making experiments of an angle of about 45° to 
m the two planes at a wave-length of 3 cm., it was easy to verify that the two tetra- 
gonal axes lie in the plane The absorption curve shows two peaks in this 

plane, but only one in {K^K^. To obtain better resolution of the two peaks, obser- 
vations were made at a wave-length of 1*3 cm. The separation between the two lines 
increases linearly with frequency, whereas the width remains substantiaEy constant. 
It is thus possible to obtain a direct measurement of and also of g^^ in the plane 
of the tetragonal axes instead of along K^. A typical curve, for copper ammonium 
sulphate, is shown in figure 3. The magnetic field is directed along one tetragonal axis, 
giving the low-field peak, and at an angle of just under 80° to the other axis. By a 
small rotation of the crystal the value of g^^ can be determined. The results of measure- 
ments of this type are given in table 2. 

For convenience, the values of g^ and gj_ ( = g^ from table 1 are repeated. It will 
be seen that the differences between the two sets of values are at first sight hardly 
outside the experimental error of ± 1 % except in the first two salts. In these there 
is a considerable discrepancy between the values of gj_ in the (K^K^ plane and g^^ 

( = hi the {K-^K^ plane, both of which are measured directly. That this is not due 
to errors in the magnetic field calibration or the measurement of wave-length was 
checked by measurements of g2, at a wave-length of 1*3 cm., which gave values in 
excellent agreement with those obtained at 3*2 cm. It is apparent that the differences 
must be due to a departure from tetragonal symmetry; this is probably true of all 
the salts, but is more marked in the potassium and ammonium sulphate. 

Since the two values of g^ obtained may not rep^ent the extreme valu^ when 
there is an appreciable departure from tetragonal symmetry, an investigation was 
made of the variation in a plane normal to one tetragonal axis for the pota^um 



414 


B. Bleaney, R. P. Penrose and Betty I. Plumpton 

sulphate. As two absorption lines of varying separation would be observed in this 
plane except along it was essential to use a short wave-length, 1-3 cm., for this 

purpose. Measurements were made in four positions, along K^, and at of 
45, 90 and 136° to it; the results are shown in table 3. 


M 


i 


0-6 


0-4 


I 

I 0-2 


0 

5-5 



A:i)plaacat 1-3 cm. wave-length for copper ammonium 

tetragonal axis paraUel to H; right-hand curve, tetragonal 
Table 2 


in plane of tetragonal axes from table 1 


salt 

sulpbates: 

potassimn 

ammonium 

rubidium 

thallium 

caesium 

selenates: 

ammonium* 

potassium 


Table 3. Vaetation of p rtr plane noruial to a tetragonal 

AXIS FOR COPPER POTASSIUM SULPHATE 
angle to observed values average 

0° 2-05, 2-05 2-05 


9, 


9, 

ffz 

2-36 

2-12 

2-43 

2*05 

2-45 

2-12 

2-39 

2-06 

2-45 

2-11 

2-45 

2*07 

2-40 

2-08 

2-39 

2-06 

2-43 

2-08 

2-43, 

2*06 

2-39 

2-075 

2-34 

2-065 

2-38 

2-07 

2-38 

2-045 


45° 

90° 

135° 


2-12, 2-13 
2-10, 2-12, 2-125 
2-O65, 2-05j 


2-12s 

2 - 11 * 

2-06 


The results show that the crystalline electric field must have an appreciable 
rhombic component, since the latter determines the departure of the p-value from 



415 


Paramagnetic resonance in the copper TvUon salts 

the free-spin value of 2, and there is a variation of at least a factor 2 in this departure. 
Consequently, it is not surprising that the values of g deduced on the assumption of 
tetragonal symmetry do not agree exactly with those measured directly. It follows 
also that the values of a may be slightly in error, though this does not affect the 
comparison with Krishnan’s values, which rest also on the assumption of tetragonal 
symmetry. The slight misalinement of the magnetic field which would be caused by 
an error in a should not result in any appreciable error in the direct measurement 
of and since they are extreme values. An error in ahnement of 7° would cause 
a change in of only 0*01. 

Although these measmements have shown that there is an appreciable departure 
from tetragonal symmetry in some of the salts, it is, nevertheless, convenient gener- 
ally to ignore it because of the increased complication it introduces. For most pur- 
poses the error introduced will be small, since the value of is markedly greater 
than the value of gr^^ for all the salts. This arises partly because of the factor 4 which 
occurs in the expression for g^ and not in that for g^ (equation (1)), and partly because 
the splitting which occurs in the denominator is smaller in 'the former expression. 
From the results reported in this paper it is possible to calculate these splittings 
for each salt. They will be substantially the same as those obtained by Polder from 
Hupse’s measurements, and in view of the lack of detailed knowledge of the crystal- 
line field the variation for different salts would have only academic interest. 

In view of the departure from tetragonal symmetry, it is at first sight surprising 
that the variation of the g^-values in the ac-plane can be adequately represented by 
equation (4a). The values of gr^, g^ and are, however, adjusted to give the b^t 
fit, and the agreement would not be so good if the proper values deduced from gf, 
and g_^ were used. The expected discrepancy is revealed by measurements in the 
{K^K^ plane, where the constants are already fixed. When g is measured in this 
plane in directions at equal angles to the (6) axis, but on either side of it, the values 
obtained are unequal, though the angle with the tetragonal axes is the same in either 
case. This can only be explained by a rhombic component. 

CONOLTJSIOK 

In the first approximation the variation of the gr-values confirms Polder^s theory, 
based on the assumption of two ions in unit cell each sub j ected to a crystaJline electric 
field of tetragonal symmetry. The separate study of each ion made possible by the 
paramagnetic resonance method shows, however, that in some of the salts there is 
an appreciable departure from tetragonal symmetty. Thus in copper potassium 
sulphate the experimental results given in tables 2 and 3 lead to a rhombic anisotropy 
with the following principal values; 

= 2-14, = 2-04, = 2-36. 

The axis of (the ‘tetragonal’ axis) is at 41° to (in the plane of and 
while and 0^ make angles of approximately 70 and 160° to in the plane normal 

to These values lead to 

gri = 2-26, gr^^ 2-053, ?3-2-23, 



416 


B. Bleaney, E. P. Penrose and Betty i. Plumpton 

in close agreement with the observed values (table 1) for the axes of principal 
magnetic susceptibility of the salt. Prom Hupse’s measurements of the susceptibilitv 
the foUowmg values are obtained ^ 

g ^= 2 ^ 27 , 9-2 = 2 - 05 , 9-3 = 2 - 21 . 

The agreement is ■within the experimen'tal error. 

^ A reference should be made here to the earlier theory of Jordahl (1934). The latter 
Ignores the presence of two ions in unit ceU and requires therefore a rhombic field 
to explain the different susceptibilities along the three axes and E,. The 

constants of this rhoinbic field are chosen to give a smaU splitting (c. 300 cm -i) of 
the orbital doublet, in order to explain the faU of the Curie constants as the tem- 
:^ature is lowered, observed by Bartlett (1933). The latter has been confirmed by 
Janes (1935) over a wider temperature range (300 to 82° K). This variation with 
tempemture does not require a small orbital splitting, however, since it can be ex- 
plain^ by the temperature-independent paramagnetism which makes an important 
TOntnbution to the susceptibility at the higher temperatures. The magnitude of the 
temperature-mdependent susceptibility for powders of copper ammonium and 
potoum sulphates deduced firom Janes’s results is O-lj x 10-^ (per g.-ion), while 
he theoretical values of Polder (equation (2) of this paper) give O-lg x 10-^. 

PART n. LINE WIDTH AND SHAPE 
7. iNTROmjCTIOBr 

paramagnetic resonance method (Bagguley, Bleanev 
Griffiths, Penrose & Plumpton 1948) it was pointed out that in general the widte 

tavh th ^ onentataon of the crystal in the field. The first of these factors fnters 
aul interaction, whose characteristic relaxation time is gener- 

^y strongly temperature-dependent. In the case of two of the copper Tutton salts 

3 X 10 s^. at 90 K, which would contribute to the line width only 1 gauss or so 
At Wr temperatures the rapid increase in the relaxation time wLd make t^ 
m^e ^ comparison of line width at 90 and 20° K was 

m^terei<^^!7 of expenmente, and no change of line width outside the experi- 

rttStuiX It r IT T therefee to 

^ 7 7 t temperatures to the effects 

of spm-spm mteraction between the various copper ions. The simplest tvne of 
mter^tion is associated with the local magnetic fidd of the neighbo^ 

fi^M i lSter"^T‘^7\“tT^^ external ma^etic 

ffeld ^ latter was found to be the case for fields between 1 and 8kG measure^ 

is tLnTdS^dtnf T Til® dipole-dipole hiteraction 

in tK dependent on the direction of the external magnetic field however as 

i<h 6 ) ""“T resonance experiment (PurceU, Bloembergen & Pomd 



Tvttxm salts 


417 



418 


B. Bleaney, R. P. Penrose and Betty I. Plumpton 

For this p^ose, measurement in the oc-plane suggests itself as most convenient 
and interesting. The nearest copper ions Ke in this plane, and the tw^o ions ia unit 
cell give identical lines. Measurement of the line width was therefore combined with 
that of the g-value (experiments A), and was made at 3 cm. wave-length. The simplest 
quantity in measure is the half-width at half intensity (AH^), and the variatira of 
this qu^tity with the direction of the external magnetic field in the oc-plane is 
shoTm in figure 4 in the form of a polar diagram, where the radius vector is pro- 
portional in length to (A^). Though there is a striking variation in magnitude firom 
salt to salt, the general form of the polar diagram is the same, having a Tnn.i> maxi- 
mum along (or vmy close to) the c-axis, and a subsidiary maximnTn practically at 
light angles. The two minima are unequal, the deeper minimum l ying sometimes on 
one side of the c-aas and sometimes on the other. Its position seems to be related 
to that of the principal axes of susceptibflity K^), in that always lies between 

the deeper minimum and thec-axis. Interpretation of these results requiresadetailed 

consideration of the interaction between the different copper ions. 


8. Magnetic dipole dtteraotion 

The positions of the neighbouring copper ions are as follows, the distances given 
teing those for the copper potassium sulphate. For the other salts the distances are 
shghtly greater corresponding to the increase in the ionic volume, but in the same 
projwrtion, since the changes in the relative lengths and directions of the crystallo- 
graphic axes are very small. 


A. 

B. 

C. 

D. 


Table 4. Positions op neaeest nbighboitbs 


2ionsat (0, 0, ± 1 ); e-lA 
4 ions at ( ± i, + J, 0) ; 7-6 A 

4ionsat( + J, ±J, ±1);8-9A 
2 ions at ( ± 1, 0, 0); 9-1 A 


E. 2 ions at ( ± 1, 0, ± 1) ; 9-4A 
E. 4 ions at (±i, + q: 1); lO-gA 
G. 2 ions at (0, ± 1, 0); 12-2 A 


AH other ions lie at distances of at least 12-2A. 


]^m the ^plest point of view the magnetic dipole interaction can be considered 
in 6 0 owing wajr. The magnetic field of a neighbouring dipole /i gives rise to a 
exponent ,«(l-3cos2d)/r3, where 6 is the angle between the direction of the 
external field and the radius vector joining the two dipoles, which modifies the 
exten^ field and so produces a spread in the Larmor precession frequency of the 
10^. Smee the field falls off with the cube of the distance, the outstan<Lg con- 
tebution wdl ^ from the two ions A, which he along the c-axis. The factor 
(1 - 3 cos d) wih therefore give a maximum along the c-axis, a subsidiary maximum 
at nght angles half as great, and zeroes at angles of + 55“ to the c-axis. This corre- 
spon^ roughly to the observed polar diagrams, except that they are less anisotropic, 
and the zeroes are of course fihed in by the fields of the more distant neighbours.' 

A cW approxunation is obtained by taking the formula of van Vleck (foAg) for 
luemacal spins: \ j 

(^‘‘)av. = + 1) s (1 - 3 cos2 


( 7 ) 



419 


Paramagnetic resonance in the copper Tvtton salts 

where 6^^ = angle between magnetic field and line joioing spins i and j, and 
is the ‘mean square moment In applying this to our case, we are ignoring the fact 
that, though the two ions in umt cell have the same gr-factor in the oc-plane, their 
axes of precession are not quite parallel, being determined by the combined effect 
of the tetragonal field and the magnetic field. However, the major contribution 
comes from the ions at (0, 0, ± 1), which belong to the same sort as that at (0, 0, 0) 
so that no serious error* is introduced by using the factor f which appears in (7) for 
identical spins, in place of J for non-identical spins. As a further simplification, we 
shall assume gr = 2*16 for aH directions in the ac-plane; this ignores a variation of 
± 5 % which is determined by the position of the axes of susceptibility and is there- 
fore different for each salt. 

The contributions to the mean square moment from the ions listed in table 4 have 
been calculated at intervals of 15° in the oc-plane. An estimate of the contributions 
from more distant ions shows that they do not increase the mean square moment 
by more than a few per cent. To compare the calculated values of ^(AJ?^)av. "with the 
measured some assumption must be made concerning the line shape. An 
approximation frequently used is that the shape is that of a gaussian distribution, 
7 = 7Qexp[— AjBr2/2(AH2)av.], for which (AHj) = l-18>y/(AJ?^)av.- Th® curve h in 
figure 4 labelled ‘ theoretical ’ is obtained on this basis for copper potassium sulphate ; 
it should be contracted slightly for the other salts in inverse ratio to their g.-ionic 
volumes (listed in table 5). IVom the shape of the theoretical curve it will be seen 
that it is determined mainly by the two nearest neighbours at (0, 0, ± 1). 

In some cases (for example, copper ammonium selenate) the line shape approxi- 
mates very closely to that of a gaussian distribution. In other salts this is by no 
means the case, and two extreme shapes are . shown in figure 5, the pointed line 
being obtained in copper ammonium sulphate. It is obvious that the small value of 
AHj for the ammonium sulphate may be due to its pointed shape, and the large value 
for the caesium sulphate to its fiattened shape. A numerical computation of Aff|^,was 
therefore carried out for the absorption lines obtained in each salt along the c-axis; 
the results are shown in table 5 (column 4)\ together with the theoretical values 
(column 3). The g.-ionic volumes are given in the second column, and the salts are 
arranged in order of increasing ionic volume. The experimental valu^f of are 
shown in the last column. 

The results presented in table 5 indicate that the values of along the 

c-axis do not differ markedly from those calculated from equation (7), though the 

* Professor M. H. L. Pryee has pointed out to us that the anisotropic ^-Values will produce 
a smaU modification of the factor f , since the contribution to the fine width due to transitions 
caused by the rotating component of the field of the neighbouring ion involves a different 

value from that associated with the steady compon^t. l^he error introduced is of the same 
order as the variation in the gr-values. 

t The probable errors in AHj are ± 5G; those given for ^(Aff*)aT sre larger because they 
depend more on the shapes of the lines in the wings, where the inteoaity is low. La this respect 
the esperimental values probably err on the low side rather than the reverse, since the 
tendency is to cut the tail off rather than to prolong it. !Oie absorption ia assumed to he zero 
when the magnetic field is far £ix>m resonance {±1 kG), where the Q of the cavitjy becono^ 
constant. Thus any tail of small intensity which extends so far is ignored, with corresponding 
reduction in the experimental values of AH*aY,. 



420 


B. Bleaney, R. P. Penrose 8>nd. Betty I. Plumpton 

values of Affj vary greatly. It is therefore interesting to examine whether a plot of 
the former quantity for the oc-plane would show a similar agreement. To obtain 
greater accuracy , a special set of measurements was carried out on copper 
selenate, for which the line shape is close to gaussian. The results are shown in figure 6 
where the experimental points are drawn with a probable error of + 6 G, and the 



magnetic field (kG) 


PieuKB 6. Line shape (along c-axis) for a, copper aniunonium sulphate 
and 6, copper caesium sulphate. 


Table 5. Liite width along the c-axis 


salt 

sulphates : 
potassium 
ammonium 
rubidium 
thallium 
caesium 
selenates: 
ammomum 
potassium 


g.-ionic 

volume 

(ml.) 

197 

206 

212 

214 

219 

224 

226 


V(Afl")av. 


theor. 

exp. 

AHi 

exp. 

(gauss) 

(gauss) 

(gauss) 

196 

214 ±10 

153 

187 

226 ± 10 

116 

181 

. 236 ±10 

330 

179 

178 ±10 

172 

175 

198 ±10 

316 

171 

180 ±10 

210 

170 

178 ±10 

141 


of eci^toon (7), Compadaon mth the curve of (figure 4) 

Tud ft of the latter are iu 

to a ehaug. in fine chape with dLtiun. The / 
tht^panae. with the theoreticai vaiues are very coneiderahle. In particular, the ' 



421 


Paramagnetic resonance in the copper Tutton salts 

sharp TTfiiTiiiimDi at 4* 55 to the c-axis is filled out, the experimental value being twice 
as great as the theoretical; and generally the width is greater than calculated, except 
near the Ih is evident that other types of interaction between the magnetic 

ions of the same order as that due to the magnetic dipoles must be present in this 
salt* 


— I — experimental 
theoretical 



FrauEtB 6. Polar diagram of in the oc-plane for copper 

ammonium selenate. 

9, Isotropic exchange estteraotioe 

In addition to the discrepancies revealed in figure 6 , the magnetic dipole inter- 
action offers no explanation of the widely divergent values of AH^ and the different 
line shapes, which seem to bear no relation to the ionic volumes. In no case do the 
lengths of the crystallographic axes depart by more than 3 % from the simple 
(3, 4, 2) ratio, and the differences cannot be attributed to change in the shape of the 
unit cell. It is necessary therefore to consider another type of iuteraciion due to 
exchange forces. Although these are primarily connected with the orbital motion 
of the electrons, by virtue of the exclusion principle they manifest themselves as 
a coupling between the spin vectors proportional to the <K>sme of the angle betwemi 
them. In the case of identical ions in a large magnetic field, this angle is independent 
of the direction of the magnetic field and the exchange forc^ are therefore isotropic. 
Gorter & van Vleck ( 1947 ) have pointed out that they give rise to a narrowing of the 
line in the centre and a broadening in the wings; that is, the line is more peaked and 
the value of A77y is less than when magnetic dipole interaction alone is present. The 
exact line shape cannot be evaluated simply, but van Vleck ( 1948 ) has calculated the 
‘mean square moment^ and the ‘mean fourth moment* . It turns 

out that the former is unaltered by exchange interaction; the discrepancy between 


VoiL 198. A. 



422 B. Bleaney, R. P. Penrose and Betty I. Plumpton 

the experimental values of and those calculated from magnetic interaction 

shown in figure 6 for copper ammonium selenate is therefore not explained by these 
isotropic exchange forces. 

In view of the large changes in line shape between different salts it is interesting 
to consider what shape would be expected from magnetic interaction alone. The 
only parameter of line shape available for comparison with theory is the ratio (p) 
of the root mean fourth width to the root mean square width 

For a gaussian distribution this ratio is 3^ = 1*32. Van Vleck shows that the curve 
should be somewhat blunter, the ratio being 1-25 for a cubic lattice of spins /S = 
For the Tutton salts, as a first approxi m ation one may assume that only the two 
nearest neighbours A at (0, 0, + 1) make a significant contribution to the line width. 
Equation (24) of van Vleck (1948) then yields 1-18 for the ratio of the root mean 
fourth and second moments. It is interesting that the experimental value of this 
ratio for the curve for copper caesium sulphate shown in figure 5 is hlg. This curve 
approximates in shape therefore to that which would be expected if magnetic dipole 
interaction alone were present,* though the actual value of the root mean square 
moment (table 5) is somewhat too large. For copper ammonium selenate the experi- 
mental value of p for the twelve absorption lines in the oc-plane used to obtain the 
root mean square width shown in figure 6 varies between 1-32 and 1*4^. Thus in their 
case the line shape never departs greatly from the gaussian, and the polar diagram 
of the root mean square width (figure 6) has the same outline as the half-width 
(figure 4/). It is true of aU the salts that no.great change in line shape is discernible 
as the applied magnetic field is rotated in the oc-plane, and it is probable therefore 
that polar diagrams of ^ would resemble those of AJJj for all the cases shown 
in figure 4, with, of course, a scaling factor dependent on the line shape. 

Since the line shape is more or less isotropic, it may be attributed to simple exchange 
interaction. The great variation of line shape between different salts is not surprising, 
since under the influence of the crystalline electric field the orbital wave functions 
are strongly directional. The wide variation in the directions of the tetragonal axes 
from salt to sidt indicates that a similar change in the overlapping of the orbital 
wave functions of neighbours, and hence of the magnitude of the exchange forces, 
may be expected. There is, however, no obvious correlation between line shape and 
the angle a which determines the position of the ‘tetragonal’ axis. In view of the 
presence of rhombic components of the crystalline field, this is not to be expected, 
especially as intervening atoms other than the water molecules (which mainly 
determine the crystalline field) will also exercise considerable influence on the 
orbital wave functions. 


10. Exchaitge iotbragtion between* the dissimilar ions 

Since the umt cell of the Tutton salts contains two copper ions subject to differently 
oriented crystalline fields, the spin vectors of these two ions will not process about 
parallel axes when a magnetic field is applied, and the angle between the vectors ■will 

* The rectangular shape may, however, be associated with the unr^olved hyperfine 
structure (see §11). ) 



Paramagnetic resonance in the copper Tutton salts 423 

vary with the direction of the external field. The effects produced then by exchange 
forces are illustrated by the (magnetically) very similar salt CuSO^^SHaO, in which 
a remarkable anomaly has been discovered by Bagguley & Griffiths (1948). When 
a magnetic field is applied in a direction making unequal angles with the two tetra- 
gonal axes, as in experiment C of part I of this paper, the resonance lines due to the 
two different ions are not resolved until magnetic fields of the order of 12 kG are used, 
although the line widths are less than a hundred gauss. This effect is attributed to 
exchange forces between the dissimilar ions (Pryce 1948) which are strong enough 
to overcome the separation in energy which the ions should possess in smaller 
magnetic fields. In addition, there is a broadening effect even when the two lines 
coincide, unless the magnetic field is applied in a direction perpendicular to both 
tetragonal axes, when even the dissimilar ions process about identical axes. 

It is obvious that in the Tutton salts such exchange forces, if present, are much 
weaker than in the single sulphate. Pigure 3 shows how the lines are completely 
separated in copper ammonium sulphate in fields of 6 to 7kG, and even at 3 cm. 
wave-length (fields of about 3 kG) the two peaks are resolved in all the seven Tutton 
salts investigated (see, for example, figure 7). It may be, however, that the exchange 
force between dissimilar ions does still produce a broadening of the lines, and in this 
connexion it may be significant that the discrepancy between the experimental 
root mean square widths and those calculated from magnetic dipole interaction 
revealed by figure 6 for copper ammonium selenate vanishes near the axis. This 
axis is perpendicular to both tetragonal axes, and exchange effects between dis- 
similar ions would not contribute to the line width. Whether they are large enough 
to explain the discrepancy when the magnetic field is applied in other directions 
cannot be estimated before further theoretical work has been carried out. They 
enhance only the contributions to (A£r^)av, from the dissimilar ions (B, C and P in 
table 4) which are rather small. In some directions they would have to be increased 
by a factor greater than 10 to account for the observed line widths in copper 
ammonium sulphate. It would be rather surprisii^ if the effect were so large, since 
it arises only through the precession about slightly inclined ax^. 


11. Effect of msTBESOLVEu hypebetn-b stbxtctube 

Apart from these discrepancies it seems that none of the interactions consideroi 
hitherto can explain the unequal widths of the resolved absorption lines shown -in 
figure 3. These lines are obtained by applying the magnetic field in a direction 
parallel to the tetragonal axis of one ion, when it will be an angle of about 80° to the 
tetragonal axis of the other ion. The latter gives the narrower line at the higher field 
(lower g-value). The spatial distribution of copper ions round each of th^e is pre- 
cisely the same except that wherever X has a neighbour of the type X, Y has a 
neighbour of the t3rpe Y and vice versa. In this type of experiment, then, where the 
external magnetic field is in the same direction relative to the crystallographic ax^ 
for each ion, one would expect that the interactions with the neighbours should be 
identical, except that there may be a variation in the magnetic interaction a^ociated 
with the different g-valu^ of the two types of ion. The maximum difiGrapenoe this 

28-2 



424 


B. Bleaney, E. P. Penrose and Betty I. Plumpton 

cause is of the same order as the difference in the g values ( « 2-4/2-OS), and its sign 
■will depend on -whether the major contribution to the width comes from aiTniio^. qj 
dissimilar ions. In this particular case, the contributions are roughly equal, and no 
significant difference in -width would be expected. The experimental values of 
however, are 184 G for the parallel ion, and 112 G for the ion whose tetragonal axis 
is nearly perpendicular to the magnetic field. Computation from the curve gives 
164 + 20 and 114 + 20 G respectively for , while the theoretical value is 

approximately 90 G for each ion. Similar differences are found for the other salts 
a narrower absorption curve being obtained always for the perpendicular ion tTia-Ty 
for the parallel ion {see %ure 7). 



ma^etic field (kG) 


froin copper caesium 

Srt 80“ toff tetragonal paraUel to H; right-hand curve, tetragonal 


Sm(» the only difference between the two ions is that their tetragonal axes are 
differently oriented with respect to the magnetic field, this difference in the mean 
squ^ moments cannot be ascribed to magnetic dipole interaction or to isotiropic 
orces. The exchange forces between dissimilar ions may enhance the tina 
wi h, but may be expected to affect either fine equally, since physically they 
corres^nd to a rapid exchange of the two ions concerned. Thus there appears to be 
some effect which contributes to the mean square moments of the absorption fines, 
w^eh IS not explicable in terms either of magnetic interaction or exchange inter- 
action between the spins. The magnitude of the contribution varies with the direction 

ot tie apphed field, being greater where the gr-values are higher. Turfcher evidence 



Paramagnetic resonance in the copjper Tutton salts 425 

in support of this last statement is provided by measurements in the {K^K^ plane. 
At equal angles to, but on either side of the axis, the line vddths are different. In 

capper ammonium sulphate at angles of + 30° and - 30° to the values of 
are 87 and 59 G- respectively. Since is identical with the crystallographic 6-axis, 
contributions to the line width from magnetic dipole and exchange forces between 
similar or dissimilar ions should be identical in such directions equally inclined to 
it, as also should those from any effect with the tetragonal symmetry assumed for 
the crystalline field. The difference in the gr-values of the two lines (2-19 and 2-13 
respectively) shows, however, that there is an appreciable rhombic component of 
the crystalline field. The difference between the line widths is in the same direction 
as that in the gr-values and could be explained by an effect vrtth the same anisotropy 
as the crystalline field. 

The nature of this effect has been revealed by subsequent work of Penrose 
at Leiden University. On using a crystal of copper ammonium sulphate greatly 
diluted with the corresponding magnesium salt, the single absorption line expected 
in the ac-plane breaks up into four equally spaced narrow lines, each with of 
about 15 G. The overall separation of the lines varies with the direction of the 
magnetic field, the maximum value of about 300 G being obtained in the direction, 
while near the lines are not completely resolved. This hyperfine structure is 
attributed to interaction between the electron spin and the copper nudeus, both 
of whose common isotopes have, spin | and nearly equal magnetic moments (2*226 
and 2*385 nuclear magnetons (Pound 1948)). With the magnetic fields used, tfie 
spectrum corresponds to that of the Back-Goudsmit region in optical hyperfine 
spectra. The separation of the four compdnents varies because the magnetic inter- 
action between the electron and nuclear spins is averaged over the distribution of 
the electron cloud density in space, i;e. the orbital wave functions, which are deter- 
mined by the crystalline electric field. J£ the latter has tetragonal symmetry, the 
separation should vary as (1 -f 3 cos^y?)^, where JS is the angle which the magnetic 
field makes with the tetragonal axis. This is roughly consistent with the initial 
measurements of Penrose. 

The separation of the hyperfine components is so large that they should be vitible 
in the undiluted salts. The absorption curves, of which example are given in 
figures 3, 5 and 7, show no trace of a fine structure. Its absence must be attributed 
to exchange forces, and perhaps also to the fluctuating components of the local 
magnetic field of neighbouring electron spins which shorten the lifetime of the 
electronic state sufficiently to ‘average out^ the interaction with the nuclear spin. 
The contribution of the latter to the mean square moment will remain, however, 
and it has been estimated by Pryce from the initial results of Penrose as 

(AjBr^)av. = X ^ gauss^. (8) 

This formula can be applied to the two resolved lines (figure 2, 6), from which reason- 
able estimates of can be obtained. The r^ults are given in the table on 

p. 426. 

The agreement is better than could be nspected at this stage. An^ysis of the 
difference between the mean square moments observed for copper Mamonium 



426 


B. Bleaney, E. P. Penrose and Betty I. Plumpton 

selenate in the oc-plane (figure 6) and those calculated fi:om magnetic interaction 
shows that it also is of the same order as predicted by equation (8). The presence of 
an imresolved hyperfine structure offers therefore an adequate explanation of the 
deviation of the mean square moments from those expected from magnetic dipole 
interaction between the electron spins. 


(AJ3’2)av, gauss^ 


salt 

angle 

with 

tetragonal 

axis 

experimental 

magnetic 

interaction 

alone 

magnetic^ 
interaction -f 
hyperfine 
structure 

copper ammonimn 

0° 

2-7 ± 0-3 X 10* 

0*8 X 10* 

2-4 X 10* 

sulphate 

76° 

1-3 ±0-2x10* 

0-8 X 10^ 

1-3 X 10* 

copper caesium 

0° 

2-3 ±0-2x10* 

0-7 X 10* 

2*3 X 10* 

sulphate 

80° 

1-3 ±0-2x10* 

0*7 X 10* 

1*2 X 10* 


12. Discussion 

The assumption of exchange forces in the copper Tutton salts is not new, since 
t ey have beenmvoked by Opechowski (1948) to explain the anomalous specific heat 
of the potassium sulphate discovered by de Klerk (1946). The constant of the 
ma^etic s;^cific heat is 6-8 x 10-^, which is some five times greater than 

the theoreticai value for pure dipole interaction. Relaxation measurements by Bill 
(1941) and Broer & Kemperman (1947) confirm this high value. De Klerk finds also 
toat the susceptibility obe^ a Cuiie-Weiss law with a Weiss constant of 0-062° K. 

su^this value, a tentative estimate by Opechowski suggests that at least ten 
neighbouring copper ions must co-operate in the exchange interaction. Reference 
to the list of neighbours, table 4 (OpechowsM’s list omits the ions labelled C and F 
m teble 4) shows that this requires aU the ions up to a distance of 9 A. The interaction 
K thus com^ratively long range. OpechowsM’s theory neglects magnetic dipole 
m ration, however, which the results given in this paper show to be at least as 
important as exchange forces, as far as line width is concerned. 

Tte hyperfine structure wfll also give rise to an additional specific heat, though 
iMufficient to account for the observed value in copper potassium sulphate. Since 
he stracture is averaged out ’ by the exchange interaction in the undiluted salt, its 
contribution to the specific heat may also be reduced. On the whole it appears that 
the abnomafiy high specific heat must be due in the main to the exchange inter- 
ac ion. IS supported by paramagnetic relaxation measurements in the liquid 
e region by Benzie & Cooke (1949) who find the specific heat of copper rubidium 

sulphate, which ^ves the broadest absorption lines in our experiments, to be 
co^derably smaller than that of the potassium or ammonium sulphates or the 
potassium, selenate, aH of which fairly narrow lines. 

In ^per sulphate pentahydrate the specific heat at hehum temperature has 

temperatures (Ashmead 1939) and the Weiss constant 

dopant as far as line width is concerned, the lines being nar- 
rower than m any of the Tutton salts, though the salt is magnetically more eon- 



427 


Paramagnetic resonance in the copper TvMon sdUs 

centrated. The distance between nearest neighbours is not much less (5-5 A) than 
in the Tutton salts (6 A) ; it is therefore not surprising that exchange effects appear in 
the latter, though to a less degree, since there are only two ions at 6 A compart with 
eight in the single sulphate; in particular, the least distance between ions of dis- 
similar kind is 7*5 A in the Tutton salts, and there is no evidence of an anomalous 
absorption spectrum attributed -to exchange between those ions, to that 

found by Bagguley & Griffiths. 


13. Coi^'CLxrsiONS 

(а) The change of line width with direction of the applied magnetic field is roughly 
consistent with that expected from magnetic dipole interaction, but the mean square 
moment is generally bigger. 

(б) The line shape is more or less independent of the direction of the applied field 
in the oc-plane, but varies markedly from salt to salt. In a number of cases the lines 
are more peaked than can be explained by magnetic dipole interaction, an effect 
which may be attributed to isotropic exchange forces. 

(c) The presence of exchange interactions is consistent with the abnormal specific 
heat of the spin-system, which is greater than can be accounted for by magnetic 
dipole interaction. 

(d!) The measured mean square moments show a departure from the calculated 
values which is greatest in directions where the gr-values are greatest, i.e, the effect 
seems to have the same 'anisotropy as the crystalline electric field. It may partly 
be due to exchange interaction between dissimilar ions, but the presence of a hypex- 
fine structure ‘averaged out’ by the exchange forces would give a contribution to 
the mean square moment of the required magnitude, and with the right anisotropy. 

The authors wish to acknowledge their indebtedness to Professor M. H. L. Pryce 
and Mr K. W. H. Stevens for their contributions* to the theoretical aspects; and to 
Mr D. M. S. Bagguley for help in growing crystals and for valuable discussion. 


Refbbenoes 


Ashmead, J. 1939 Nature, 143, 863. 

Bagguley, D. M. S., Bleaney, B., Griffiths, J. H. E., Penrose, K. P. & Plumpton, B. I. 1948 
Proc. Phys. Soc, 61, 542. 

Bagguley, D. M. S. & Griffiths, J. H. E. 1948 Nature, 162, 638. 

Bartlett, B. W. 1932 Phys, Rev, 41, 818. 

Bartlett, B. W. 1933 Phys. Rev, 44, 687. 

Benzie, R. J. & Cooke, A. H. 1949 (Unpublished.) 

Bijl, D. 1941 Physica, 8, 461. 

Bleaney, B. & Penrose, R. P- 1948 Proc. Phys. Soc. 60, 396. 

Broer, L. J. E. & Kemperman, J. 1947 Physica, 13, 466. 
de Klerk, D. 1946 Physica, 12, 613. 

Gorter, C. J. & van Vleck, J. H. 1947 Phys. Rev. 72, 1128. 

Hofmann, W. 1931 JZ. KrystaUogr. 78, 279. 

Hupse, J. G. 1942 Physitn, 9, 633. 

Jackson, L. C. 1923 PhU. Trans. A, 224, 1. 

Jackson, L. C. 1926 PhU. Trans. A, 226, 107. 

Janes, R. B. 1935 Phys. Rev. 48, 78. 



428 


B. Bleaney, R. P. Penrose and Betty I. Plumpton 

JordaM, O. M, 1934 Fhys. Eev. 45, 87. 

Trrighna.rt , K. S., Chakravorty, IST. C. & Barter jee, S. 1933 FMl. Trans. A, 232, 99. 
Krishnan, K. S. & Mookherji, A. 1938 FMl. Trans. A, 237, 135. 

Mooklierji, A. 1945 Indian J. Fhys. 9, 63. 

Opechowski, W. 1948 Fhysica, 14, 237. 

Polder, D. 194a Fhysica, 9, 709. 

Poimd, R. y. 1948 Fhys. Bev. 73, 523. 

Pryce, M. H. L. 1948 Nature, 162, 639. 

Pxareell, E. M., Bloembergen, N. & Pound, R. V. 1946 Fhys. Bev. 70, 988. 

Rabi, 1. 1. 1927 Fhys. Bev. 29, 174. 

Reelde, J. 1939 Froc. Boy. Soc. A, 173, 367. 
van Vleck, J. H. 1948 Fhys. Bev. 74, 1168. 


The theory of plane plastic strain for anisotropic metals 

Br R. TTtt.t. j Cavendish Laboratory, University of Cambridge 


{Communicated by E. Orowan, FJR.8. — Beceived 4 February 1949) 


A yield criterion and plastic stress-strain relations are formulated for anisotropic metals 
deformed under conditions of plane strain. The equations are shomi to be hyperbolic, the 
characteristics coinciding with the directions of maximum, shear strain-rate. When the 
anisotropy is uniformly distributed, the variation of the stresses along the characteristics 
is expressed in terms of eUiptic functions, and geometrical properties of the field of character- 
istics are established. The theory is applied to the problem of indentation by a flat die. 


1. Intbodijotion’ 

In an earlier paper (Hill 1948) a theory was proposed describing the macroscopic 
plastic behaviour of polycrystaUine anisotropic metals. The theory was shown to 
be consistent with the experimental evidence then available, and has since been found 
to be in accord with more recent data (an account will be presented elsewhere). It 
seems worth while, therefore, to pursue the implications of the theory in greater 
detail. 

The present paper is concerned with the two-dimensional problem of plane plastic 
strain. General methods of integrating the plane-strain equations for isotropic 
metals are well understood; the equations are hyperbolic, the characteristics being 
in the directions of maximum shear stress or shear strain-rate. Relations describing 
the variation of the stress and velocity components along the characteristics, to- 
gether with various geometrical theorems, were established by Hencky (1923) and 
Geiringer {1930). The equations of plane strain for anisotropic metals are also hyper- 
bolic, and the characteristics are in the directions of maximum shear strain-rate 
(Hill 1948). The main purpose of the present paper is to obtain the relations holding 
along the characteristics, and to examine whether there are' simple geometrical 
properties analogous to the theorems of Hencky for isotropic metals. 



429 


Theory of plane plastic strain for anisotropic metals 

2. The equations beferbed to the axes of aihsotbopy 


The anisotropy is regarded as being due to a preferred orientation of the grain 
texture, and is assumed to have three mutually orthogonal planes of symmetry at 
every point. Their lines of intersection are called the axes of anisotropy. In the 
following analysis it is supposed further, for simplicity, that the anisotropy is 
uniformly distributed in magnitude and direction. This is, for example, approxi- 
mately realized in a bar or strip cut from the central part of a cold-rolled sheet; the 
axes of anisotropy lie in the direction of rolling, transversely in the plane of the 
sheet, and normal to this plane. 

Let {x, y , z) axes of reference be taken coincident with the axes of anisotropy. 
Let the state of plane strain be such that the z axis is normal to the planes of flow. 
Then, according to the theory formulated in the earlier paper (Hill 1948 ), the yield 
criterion takes the form 

— y F 0 ^yy) ” 1 ? ( 1 ) 


where F, G, H, N are parameters expressing the state of anisotropy. If X, F, -Z are 
the uniaxial tensile yield stresses in the directions of the axes of anisotropy, and 
T is the yield stress in shear in the {x, y) plane with respect to these axes, then 


JL^ 2, ' 


2 (? = 


2 H = 


JL 

22 + J-8 ya. 

J_ J J_ 

^2+y2“^2’ 


22V = — 

I 

The stress (Ti^, normal to the planes of flow, is given by 

= {0(T^+F(ry^)l{0+F). 

The stresses also satisfy the equations of equilibrium 

dx dy ’ dx dy 


( 2 ) 


(3) 

(4) 


Eor reasons that will appear later it is convenient to Tmte 


N{F-{-0) 

%{FQ+OH+HF) 


( — 00<C<1). 


(5) 


If ^ is greater than both F+2H and G + 2H, c is negative, while if ^ is less than both 
F+2H and G + 2H, e is positive. States of anisotropy for which N is intermediate 
to jP + 2H and G + 2H appear not to be easily realizable in practice, c is zero if the 
material is isotropic, and also for states of an^tropy such that 

2V = l'+ 2 J? = G+2H {FJpH). 



R. Hm 


430 


With the notation of equation (5) the yield criterion can be rewritten as 

= ( 6 ) 

The plane-strain tensile yield stress a* in the direction making an angle 6 with the 
X axis is found by substituting 


cr^ = (TGos^d, 0*2^ = crsin^^, <r^ = crsin^ cos 0 , 
in (6), leading to o’ = 2 t ( — . V. 

\l-“csui2 20/ 


( 7 ) 


It is evident that <r has equal values in any set of four directions ± 0 , ± ( Jtt — 6 ), n 
hence that the angular variation of cr is symmetrical about the axes of anisotropy 
and about the 46° directions. The corresponding values (3) of cr^ are, however, 
different unless F — 0. If c is positive, <r has a TniuiTuiim value 2T ^ {I— c) in the 
directions of the axes of anisotropy, and a maximum value 2T in the 45° directions; 
if c is negative, cr has a ma ximuTn value — c) in the directions of the axes of 

anisotropy, and a Tnirn'miiTn value 2T in the 45° directions. 

The tensor representing the rate of strain referred to the axes of anisotropy, 
is assumed to be related to the fcmction/({rfj), governing yielding, by the equation 


e -A 


( 8 ) 


where A is a positive factor of proportionality which varies both ia space and time. 
If other axes of reference are taken, the components of stress and strain-rate are 
related by a similar equation, the form of the function / being changed under the 
transformation because of the anisotropy. If and Vy denote the components of 
velocity referred to the anisotropic directions, elimination of A from (8) gives 


9a: 9y 


0 , 



^xx ^yy 


( 9 ) 


The first equation expresses the incompressibility of the material, and the second is 
equivalent to a relation between the orientation ifr of a principal stress direction, 
with respect to the x axis, and the orientation ^ of a principal strain-rate direction: 


tan 2^' = (1 - c) tan 2f. (10) 

Hence '{Jr = ijr only if ^ = 0, 45 or 90° ; this is a consequence of the fourfold symmetry 
of the angular variation of the tensile yield stress. 

The five equations (4), (6) and (9) between the five unknowns a-yy, cr^, 
and Vy, involve only two parameters, namely, T, which is a measure of the average 
resistance to deformation, and c, which specifies the state of anisotropy in the 
planes of flow. T and c can be experimentally deteamined by two measurements, 
for example, in compression tests (under conditions of plane strain) at 0 and 46° to 
the axes of anisotropy . It is only necessary to know the separate magnitudes of the 
four parameters F, G, E, N (or X, T, Z, T) if it is required to calculate cr^. 



Theory of plane plastic strain for anisotropic metals 431 

3. The existence oe ohaeacteeistics 

It will now be shown that there exist curves (characteristics) across which certain 
derivatives of the stress and velocity components may be discontinuous under suit- 
able boundary conditions. Suppose that the stress distribution has been determined 
in some region to the left of a curve C (figure 1 ), and that it is required to find whether 
this solution can be continued to the right of G, Let Cartesian axes of reference 
(^, 7i) be taken coincident with the normal and tangent at some point P on (7, and 
let <f>Q be the anti-clockwise rotation of the ^ axis ficom the x direction of anisotropy. 
The corresponding components of stress are denoted by and cTg^. 



Figtjee 1. Co-ordinate axes for characteristic directions. 


If the distribution of stress is assumed to be continuous, the derivatives dcrggjdij, 
dcr^JdTi and dcrg^ldi] at the point P are continuous across G, Hence, from the equations 
of equilibrium, daggjd^ and are also continuous across G at P. Differentiation 

of the yield criterion /( cTi^, cr^jj, cTg^) = 0 with respect to ^ gives the equation 


3/ 8o-gg df do- df 8(rg 
8o-jg 3r^3<7-„ 3| 8g 


( 11 ) 


from which to calculate 9<r^^/9^. The solution is unique, unless 9//9cr^^ = 0 at P, that 
is, unless = 0, according to (8). In this case, further boundary conditions must 
be prescribed if the solution is to be continued across G; these may be such that 
dcr^Jd^ is discontinuous. G is therefore a characteristic if it coincides at every point 
with a direction of zero extension. Through each point pass two characteristics; the 
orthogonal directions of zero extension. Since the material is incompressible these 
are also the directions of maximum shear strain-rate (slip-lines), but not, in general, 
maximum shear-stress directions. 

If, then, the tangent at P is a direction of zero extension, df/dcr^g = — ~ 

From(ll),9a'^,y/9^ = 0,andsimilarly,bydifferentiatingthe yield criterion withrespect 
to ^ , it may be showm that dcrg^ldri = 0. Hence, from the equations of equilibrium, 






dr} 


( 12 ) 


when the (§, r}) axes coincide with the directions of zero extension at F. 

The existence of characteristics for the velocity components («|,®,) may be simi- 
larly demonstrated. Supposm^ the velocity to be continuous, the derivatives 
dvgjdr} and dv^ldr} at P must be continuous across C. 'Etom the incompressibility 



432 


R. HiU 


equatdonit follows that 0W|/9£ is also continuous. However, if 3//9<r|g = —df/dcr^^ = 0, 
there may be discontinuities iu A and The directions of zero extension, 

therefore, are also characteristics for the velocities. If the (^,7) axes coiucide with 
the directions of zero extension at P, 




-^- 0 . 

dri 


(13) 


The inclinations dyjdx of the characteristics relative to the x axis of anisotropy 
noay easily he shown to be the roots of the equation 

= 0. (14) 


4. REIiATIONS AliOITG THE CHARACTERISTICS 

'For applications to special problems it is necessary to regard the characteristics 
as curvilinear axes, and to introduce the corresponding stress components cr^^, 
where the two families of characteristics are denoted by the symbols a and jS, 
The families are distinguished by the convention that shall be a positive quantity, 

to preserve the analogy with the isotropic theory. If §5 is the anti-clockwise orienta- 
tion of an a characteristic to the x axis of anisotropy, the yield criterion (6) becomes 


(Tfi/i, o-ap) s [((r„„ - <Tpp) COS 255 - 2cr„^ sin 2 <f>f 

+ Up'aa - ^pp) sin 2 ^ + 2 cr„^ cos 2 <j)f = 4P2 

on transforming the components of stress. The condition for a characteristic, namely, 
3//3o'«« = - W^pp = is 

[(o'aa - o-pp) cos 2^ - 2(T^p sin 2^] + siu 2^[{<7^ - (Tf^) sin 2^ + 2£r„^ cos 2^] = 0. 

Solving these two equations for (tr^a — cr^^) and 

^=(l-csin*2^6)*, ■ 

ia-^-CTpf) _ 2c sin 2^5 cos 2^ 

T (l-csma2^)* 

where the positive square root is to be taken. It may he verified that 

Hence = = <T,p = Th, 

■v^here = (1— csin®2^)*, 

andp = —iicTaa+cTpp) is the mean compressive stress. 


( 16 ) 



Theory of plane plastic strain for anisotropic metals 433 

To express the relations (12) along the characteristics in terms of the rariation 
of p and substitute 

o'k = -S> + i(o'««-o';«;«)cos2(9i-fio)-(r„^sin2(^6-54o), 

O',, = -iJ-i(o'«„-o'/f/s)cos2(?J-9io)+cr„^sin2(55-^6o)- 
With the use of ( 16 ), equations (12) become 


Hence 


(4L 


df- 


){tl- 


Since P is a general point, 


V 

■^+g = constant on an a-charactenstic, 
JiJ. 

gr =» constant on a ^-characteristic, 


where 


(17) 


By the definition ( 16 ) of ^($ 5 ), can be expressed as 

|csm2^ cos 2^ 


g(.<i>) = 


(l-csin2 256)* 


+ \E{u,h)^ 


( 18 ) 


where 


sin 2^ = sn {u, h) and == c. 

sn(«t,A;) is the Jacobian elliptic function with modulus k (which takes complex 
values when c is negative), and E{u, h) is the standard elliptic integral of the second 
kindi^ f 2^ 

E{u^ i?) = f dn^ {u^ lc)du^[ (1 sin^ d)^d6. 

Jo Jo 

For values of in the range ( — Jzr, Jtt) u lies in the range ( — K), where K is the 

quarter-period of the elliptic function. When ^ lies in the range (J/r, Jtt) u must be 
taken in the range {K^ 2 K), and so on for other values of When c = 0, equations 
( 17 ) reduce to the well-known relations for isotropic metals, due to Hencky (1933): 


JL 

2 T 


= constant on an a-characteristic, 


^ = constant on a ^-characteristic. 
* Tabulated in Jahnke & Emde (1933). 



434 


R. HiU 


Alternatively, Vj may be regarded as the independent variable instead of and the 
formulae written in the form 




1 


— E{u) — constant on an a-charaoteristic, 
^M+E(u) — constant on a y?-characteristic. 


(19) 


To transform (13) in a similar "way, substitute 

Vg = «;„ cos (0 - ^o) - ‘Op sia - ^o), 

sin (^ - $5o ) + ‘Op cos 

where and Vp are the components of velocity referred to the (a, p) curvilinear axes. 

■Hien P g -j 

|^^{v„cos(^-9io)-v^sin(0-95o)}J^_^ =0, 

K (9^ ~ 9^o) + 'Op cos ip - = 0. 

These reduce to dv^--v^d<f> = 0 along an a-characteristic, 
dv^-^Vad(f> = 0 along a y?-characteristic. 

These are identical in form with the equations of Geiringer (1930) for isotropic 
metals. 



5. Properties op the pield op characteristics 

In the isotropic theory of plane plastic strain several geometrical properties of 
the field of characteristics were established by Hencky . Since they are indispensable 
for the solution of special problems, it is natmral to examine whether analogous 
properties exist in the anisotropic theory. 

It is convenient to introduce the quantities (a, J3) defined by the relations 

2 a = ^-?, (21) 

According to (17), a is constant along a /?-curve, and JS is constant along an a-curve. 
a and fi may therefore be regarded as curvilmear co-ordinates for the field of cha- 
racteristics. Prom (21) 

fl' = i5-a, ^ = a+)ff. (22) 

Consider any pair of ^-curves, a = and a = oc^, say. The difference in the values of 
g wheire an a-curve = constant) cuts the two ;ff-curves is, by (22), equal to cc^ — oq, 
and is therefore independent of Thus, the difference in the values of g (or p), 
where two given curves of one family are cut by any curve of the other family, is a 
constant. Conversely, any two orthogonal families of curves possessing this pro- 



435 


Theory of plane plastic strain for anisotropic metals 

perty constitute a characteristic field for a plastic mass in equilibrium under certain 
boundary conditions. In Hencky’s theorem it is the difiference in the values of ^ 
that is constant along two given characteristics. This is not true for anisotropic 
material except in the special case where the difference is \ 7 !. 

If, now, a section of an a^urve, say, is straight, <j> is constant along it; hence g, p 
and a (as well as /?) are all constant along the section. It follows from the previous 
theorem that the corresponding sections of all a-curves are also, straight, and so, 
as a simple consequence, that they are of equal length. An example of such a field is 
that consisting of radii and concentric circular arcs, whose common centre is a point 
singularity for the stress distribution. This field appears in the problem solved 
below. 


6 . InDBITTATIOIT by a BLAT rigid DIB 

It is evident that the present theory is only applicable throughout a process of 
plastic deformation so long as the state of anisotropy does not change appreciably, 
or changes in such a way as to remain uniformly distributed. The indentation of the 
plane surface of a block of metal by a fiat rigid die satisfies the first condition, since 
the amount of plastic strain in the material near the die is restricted by the sur- 
rounding elastic material. The indentation of an isotropic mass was first discussed 
by Prandtl (1920), and recently by the writer (Hill 1949). In the latter paper it is 
shown that, if elastic strains are neglected, indentation cannot begin until the plastic 
region has spread sufficiently far to include the two velocity characteristics extending 
from some point on the die to the free surface. The development of the plastic zone, 
as the load on the die is increased^ is governed by the state of stress in the elastic, 
or non-plastic, material. However, with a natural assumption about the general 
direction in which the plastic zone spreads, the distribution of stress on the die can 
be determined at the moment when indentation begins, without a detailed knowledge 
of the stress in the non-plastic material. These considerattions are equally applicable 
to the indenting of an anisotropic mass. 



FiGimE 2. Plastic region and cbaracteristics for indenting of block 
■with X axis of anisotropy inclined at an angle 7 - 

In figure 2 let 00 ' be the surface of contact (assumed fidctionless) between the die 
and the material. The dimensions of the block, and the length of the die in the 
direction normal to the plane of the paper, are supposed to be large compared with 



436 


KHm 


the width 00' of the die. At the moment when indentation becomes possible, it is 
assumed that the plastic zone covers the area between the broken curve and the 
section PP' of the surface. By the properties of characteristics the state of stress 
is uniquely determined within the triangles OFQ and O'P'Q' formed by the inter- 
secting pairs of characteristics through 0, P and 0\ P\ respectively. The state of 
stress in each of these regions is a uniform compression parallel to the surface. Let 
7 (0 < 7 < Jtt) be the angle between the surface and the x axis of anisotropy (in the 
sense indicated in figure 2), and let angles POQ and O'P'Q' be denoted by <5. Then, 
according to (7) and (14), 

= «^ = y+icot-H(l-c)taB27}, (24) 


where the inverse cotangent is an angle in the interval ( — ^tt) . It is easy to show 
that,no matterwhatthevalueof7,5hes between cot”^V(l “C)and|^7r—cot”^-^(l — c). 

The characteristic fields in the regions OQR and O'Q'R'^ defined respectively by 
the singularities 0 and O' and the positions of the characteristics OQ and O'Q'y 
consist of radii and concentric circular arcs. The positions of OR and O' R' are deter- 
mined by the condition that the surface 00' is free from friction. The regions ORS 
and O'R'S are therefore uniformly stressed, the principal axes of stress being 
parallel and perpendicular to the surface. Hence the orientation of the character- 
istics is the same as in regions OPQ and O'P'Q', the angles QOR and Q'O'R' being 
Jtt. According to (17) and (18), the value of p in regions ORS and O'R'S is equal to 
PQ+2TjE, where 

* E = E{K,1c) = P''(l-Psin20)*d0 

Jo 

is the complete elliptic integral of the second kind. The pressure on the die is therefore 
uniformly distributed, and is of amount P, where 

— = ( — ^ + (26) 

2T \l-csm^2y) ^ ^ ’ 

When c = 0 this reduces to the Prandtl formula P = 2P(1 + iir). For a small degree 
of anisotropy, E can conveniently be calculated from the series 




c 3c^ 5c® 


where F is the hypergeometric function. The series expansion for P is 


(26) 


S ^ + l~l(^ +|+sin®2y) .... (27) 

Tables for P as a function of c or are available (Jahnke & Emde 1933) for the 
calculation of P for finite degrees of anisotropy (figure 3). If c is positive, P is always 
less than 2T{1 + ^7t); if c is negative, P is greater than 2P(l + |7r). Furthermore, 
P and the configuration of characteristics are the same for orientations 7 and 
— 7 of the axes of anisotropy; this is due to the symmetry of the anisotropy about 
directions making 45® with the axes of anisotropy (§2). ' % 



Theory of plane plastic strain for anisotropic metals 437 

The position of the point S has so far not been specified. For isotropic material 
symmetry requires that 8 should be the midpoint of the die face. For anisotropic 
material, however, a priori considerations cannot decide the position of 8, since it 
is dependent upon the state of stress in the non-plastic material, for it is this that 
controls the development of the plastic zone. 



Figxirb 3. Relation between indentation pressure and orientation 
of axes of anisotropy for various values of c. 


As in the isotropic theory, the flow streamlines coincide with the characteristics 
parallel to PQB8 and P'Q'R'8, respectively. This follows from the property of 
equations (20) that the component of velocity in the direction of straight character- 
istics remains constant along them. The plastic material below PQR8 and P^Q'B' 8^ 
though stressed to the yield limit, is constrained to remain rigid by the surrounding 
non-plastic material. Hence the constant value of the velocity component along 
each of the characteristics normal to PQR8 and P'Q'R'S must be zero, proving 
the stated result. It follows that the resultant velocity is of magmtude V sec S in 
O'P'Q'R'S, and of magnitude V cosec 8 in OPQB8, where V is the downward speed 
of the die. 


References 

Geiringer, H. 1930 Proc, Zrd Int, Congr. Appl. Mech. 2, 185, 
Hencky, H. 1923 Z. angew. Math. Mech. 3, 241. 

TTilT , R. 1948 Proc. Roy. Soc. A, 193, 281. 

Hill, R. 1949 Q^airt. J. Mech. Appl. Math. 2, 40. 

Jahnke, E. & Emde, F. 1933 FunUionentaJdn, 2nd ed. p. 141. 
Prandtl, L. 1920 Nachr. Ges. TFiss. Gottingen, p. 74. 


VoL 198. A. 


29 




A theory of the film phenomena of liquid helium ii 

By H. N, V. Temperley, King's College, University of Cambridge 
{Communicated by D. 72. Hartree, F,B. 8 , — Received 28 February 1949) 

A theory of the liquid helium filrrt on the general lines of that due to Schiff (1941) is pro- 
posed, the attraction between the walls of the container and the helium atoms being balanced 
against gravity, but the wave-like nature of the helium atoms and their mutual attractions 
are now considered. The predictions of the variation of -RItyi thickness with height agree 
with experiment in order of magnitude. A start is made on the problem of calculating the 
variation of film thickness with temperature, and a new interpretation of the rate of transfer 
of helium by the film is suggested. 

1. iNTEODTTCTIOlSr 

Theories of the formation of the RoUin film have been given by Frenkel (1940), 
Schiff ( 1 941 ) and by Bijl, de Boer'& Michels (1941). The first two authors gave theories 
which are based on classical considerations, except that they assume an attractive 
force of London type between the helium atoms and the atoms of the walls. Frenkel 
(1940) attempts to account for the fact that ordinary liquids do not form such thick 
films by postulating that the relations between the various surface energies involved 
may be different in helium from what they are in ordinary liquids, while Schiff 
(^94^) pobits to the very small viscosity of liquid helium n, which might mean that 
ordinary liquids are unable to form such films because they would evaporate faster 
than fresh liquid could flow into them from the bulk liquid, owing to the retarding 
effect of viscosity. Bijl et al. (1941) postulate that the abnormal thickness of the 
helium film is a consequence of the existence of zero-point energy, which ‘blows out ’ 
the lattice-spacing of the atoms in the film in much the same way that the lattice- 
spacing of the atoms in the liquid is greater than can be accounted for on the basis 
of the interatomic forces (Simon 1934). However, they assume for this zero-point 
energy of the film the expression h^jSmt^ per atom, where ^ is the thickness of the 
film, which is obtained by neglecting the interactions of the atoms with one another, 
and assuming that they are described by a wave function of sine type, vanishing at 
the wall and at the outside boundary of the film. (In what follows, the ‘inside’ of 
the film is always to be taken to mean that part nearest the wall.) The form assumed 
for this zero-point energy has been severely criticized by Mott (1949) on the ground 
that such a term would be completely absent if interactions were taken into account 
in any reasonable way. For a film about 100 atoms thick, it ceittainly seems that 
any zero-point energy would be mainly settled by the lattice-spacing, which would 
fix the volume available per atom, rather than directly by the film thickness. 

2. Description' oe the theory 

The energy-levels available to a helium atom moving in an inverse-cube law 
potential due to the attraction of the wall are first considered. The strength of this 
attraction can be estimated from the adsorption energy, and it is shown that, by 

[ 438 ] 



A theory of the film 'phenomena of liquid helium ii 439 

itself, this field is insufficient to provide bound states for atoms at distances of the 
order of 3 x 10“® cm. from the wall. A similar problem is then considered for a helium 
atom moving in the combined field of the wall, and of layers of atoms covering it, 
the attraction of the latter being estimated from the interaction potential for helium 
atoms. The behaviour of such a fi l m , in equilibrium with a liquid above and below 
the A point is next studied, on the assumption that the liquid undergoes a 'Bose- 
Einstein condensation’, though a detailed knowledge of the energy-levels of the 
liquid is not required. It is found that the fact that ihe film is thick only below (and 
possibly for a short range above) the A point can be satisfactorily explained. 
The ' bound ’ states, representing an atom adsorbed on top of the film, are relatively 
few in number compared with the Volume’ states, representing an atom moving 
about in the liquid or vapour phase, but they are, in general, of lower energy. As the 
filTYi becomes thicker, the energy gain on going into a bound state becomes less, 
because of the rapidly diminishing attraction of the walls. Below the A point the 
filTYi may continue to build up nearly to the thickness at which bound states are no 
longer possible, but above the A point the main consideration is that there are 
relatively few bound states, and they will only fill as long as there is an appreciable 
energy gain to compensate for the small number of states. A theory of the variation 
of thickness of film with temperature is given, but there appear to be no reliable data 
available at present. However, by balancing the change of gravitational energy with 
height against the change of adsorption energy with thickness, it seems to be possible 
to get agreement with some unpublished data of L. C. Jackson on the variation of 
film thickness with height. Lastly, it seems possible to interpret physically the 
observed Velocity of transfer ’ and the observed effect of a constriction in the tube 
through which the film is flowing. 

These bound states are presumably present on any boundary wall, even if it is 
submerged in the liquid, the bound states being sharply distinguished from the liquid 
states by the fact that their wave functions vanish except in the immediate neigh- 
bourhood of the wall. The existence of these additional states may not have any 
great effect on the equilibrium properties of the liquid, but certainly will affect the 
transport properties profoundly. It is hoped to examine this latter point in another 
paper. 

It should be pointed out that another investigation of the writer’s (to be published 
shortly) has shown that existing theories of the ‘Bose-Einstein condensation’ are 
unreliable in the transition region, though they give correct results at high and at 
low temperatures, and the qualitative prediction of a condensation effect is justified 
by more exact investigation. It is therefore likely that the theory given in this paper 
of the variation of thickness of film with temperature is unreliable near the transition 
temperature, but it is put on record pending the discovery of an exact method of 
handling such theories in the transition region. 

3. The motion of a heltum atom in the attractive 
ITEI il) DUE to the WAHL 

It is assumed that the attraction of a wall molecule for a heKum atom is of the 
London inverse sixth-power type, which leads to an inverse-cube law for the 

29-2 



440 


H. N. V. Temperley 


potential of 
equation is 


a helium atom near a semi-infinite wall. The corresponding wave 


dx^ 


{^TT^m 

\ 1 ^ 


JE + 




(1) 


where the closest distance of approach of the atom to the wall is the quantity so 
that ^ must vanish for x zero. If the wall is bare, we take p equal to s, the atomic 
diameter (not the radius because the radius of an atom of the wall must be allowed 
for), while if the wall is covered with a film of thickness t, the closest distance of 
approach is then s-\-L For E zero, equation (1) can be solved exactly, the solution 




(the other solution becoming infinite at infinity). We can 


estimate the constant determining the strength of the attractive field, by the 
following argument. The adsorption energy of helium on glass is known (Keesom 
& Schweers 1941) to be in the region of 90cal./mole. On a classical basis this 

would suggest a value of , . - — x (2*6 x 10®)® = 9*16 x 10-®® for ■ or 

6*06 x 10^® Sn^m 

Jc^ = 1-11 X 10-® if 3 , the atomic diameter, is taken as 2*6 x 10”® cm., but we have 

still to verify that this field is sufficient to give a strongly bound state, so that the 

actual energy lies near the bottom of the potential well, lip is the closest distance of 

approach, the wave function must vanish at x zero. The first non-trivial zero of 


2 k 

J^(q) is at g = 3*83, so that the condition for a bound state to be possible is > 3*83 

. *■ !P 


or^3<3*02x 10”’ cm. We thus conclude that ioxp = s, the effective diameter of a 
hehum atom (taken as 2*6 x 10”® cm.), the atom is very strongly bound and our 
assumption is justified. (As a matter of fact, there is a second bound state lying above 
the one in question.) It is possible to determine, by a variation method, the height of 
the bound state above the bottom of the well, and thus to compute a correction to fc®, 
but since A® is not required very accurately, this will not be done here. For this value 
of i®, the maximum value of p for which a bound state is possible is 3*02 x 10”’ cm. 
To account for a film of the order of 3 x 10”® cm. thick, P would have to be increased 
by a factor of 10, which may be ruled out as quite impossible, in view of the adsorption 
data* 

We therefore conclude that the attraction of the walls 4s incapable, by itself, of 
holding films of the observed thickness. 


4. Atteactiok by the combinei) field of wall and heltctm film 
The wave equation may now be written 

gy / 87r®mjg fc® Z® \ / A 

go;® \ A® (5 + ^-l-a;)® (^-i-ir)® {s-ht + x)y^ ^ ' 

where the last two terms represent the van dor Waals attraction due to the film of 
thickness t. The value of Z® is needed as accurately as possible, as the predictions of 
the theory prove to be sensitive to its precise value. Unfortunately, there are two 
sources of uncertainty. In the fib:st place, the attractive force between two helium 



A theory of the film phenomena of liquid helium n 441 

atoms is not known very accurately, in the second place we have no knowledge of 

1*5 X 

the density of helium atoms in the film. We shall take the value g erg for the 

attraction potential between two atoms, which Keesom’s discussion (chap, n of his 
book Helium) seems to show to be reasonably consistent with most of the evidence. 
As we are regarding the atoms as impenetrable spheres, we are not interested in the 
various repulsion potentials that have been proposed. We shall take the density to 
be the same as that of the bulk liquid,* the reason for this being that the density of 
most of the film is probably fixed by the same consideration that fixes that of the 
bulk liquid, namely, a balance between the attractive forces and the 'zero-point 
repulsion’ (Simon 1934), rather than between the attractive and repulsive forces. 
No doubt, in the first few inside layers, the density.is increased by the' effect of the 
powerful field of the wall to a value approximating to that of the solid, but a rough 
calculation indicates that this effect can hardly be appreciable beyond the third or 
fourth layer. (In the first layer the potential energy due to the wall is 90 cal./mole 
compared with a value of 70 cal./mole for the estimated zero-point energy of the 
bulk liquid.) 

By two simple integrations, we find that the energy of a single atom due to the 
attraction of a slab of Hquid of thickness t is given by the expression 


ttN/ K 

K ' 

6 \(a; + s)^ 

(a:+s+i)s, 


if the attraction of a single atom is K jr^ and the slab contains N atoms /cm.^. Inserting 


the value 0-146 for the density, we obtain the value 1-73x10""^® for 


STT^m 


or 


P = 2-10 X 10“’. 

We now determine the binding energy by a variational method. We are interested 
mainly in the case of small binding energies and ^>5. This suggests that we 


use 


(3-83s^\ 

{x + ^ wave function, as it satisfies the 

boundary conditions at the origin and at infinity, is obviously of the right form 
6“^ for X large, and is of nearly the right form for x small provided that the binding 

{H-iJnJrdT 

energy is small. Inserting this function in the usual formula E = ' 




and 


tlfilrdr 


with respect to a, we obtain the following expression for the binding 
energy when this quantity is small compared with the depth of the potential 'wells ’ : 


E = ^ 


Z 2 Trhn,\_ \(3*83)*s2 


\ 22-12 


(3-83)2 5 , 


provided 


(3) 


The terms in and 1/^^ represent respectively the effect of the attractive terms 
l^l {x -h 5)^ and -“P)l{x+s + while the term in 1/a represents the kinetic energy 
that the atom possesses through being in a bound state. It is interesting to see that 


* I am very much indebted to Dr K. R. Atkins for this suggestion. 



442 


H. N. V. Temperley 

this formula contains a term in strictly speaking it is in 1 /( 5 + just like the 
de Boer-Michels-Bijl theory, but its origin is entirely different from the term in 
the latter theory. 


5. Motion along the etlm boundary : the eeeect oe gravity 

Since the effect of an inverse-cube law of attraction is not the same as what one 
deduces on a purely classical basis, we have to examine whether we can use the 
classical expression for the gravitational energy. We suppose first that gravity acts 
along the z axis, but that the attractive forces do not vary significantly as we move 
parallel to the surface. In this case, the wave equation in the 2 :-direction can be 
solved exactly in terms of Bessel functions of order one-third, the wave function 
oscillatmg extremely rapidly with distance as long as the momentum along the z axis 
is real, but dying away to zero extremely rapidly as soon as the momentum becomes 
imaginary, so that is quite safe to use the classical expression for the gravitational 
energy. If we make the opposite assumption that the attraction of the film does 
vary periodically with the y and z co-ordinates, so that, at any instant, there are^ 
so to speak, adsorption sites at the surface of the film, then again it is not difficult 
to show that the classical expression for the gravitational energy is adequate. * 
The reasonable assumption is made that, travelling up the film, the binding energy 
of the atoms in the outside layer remains the same, so that the energy mgz has to be 
balanced against the effect of diminishing t in expression (3). Inserting the values 
^ = S-Sx 10”“®cm., = 2-lOx 10“’cm., a = 2-6x 10~®cm., the value for t being 

taken from Dr L. C. J ackson’s unpublished measurements,* we arrive at the following 
theoretical relation between z and t: 

h? r . 0*90 xion^ 

32^1_2-06x10-’ + ^ J = constant. (4) 

From Jackson’s measurements, extending to a range of 1 to 2 cm. above the liquid 
surface, one can conclude that dtjdz 10~®. The spread of his measurements prevents 
us from determining the second derivative, but it appears to be small and positive. 
The value of the first derivative deduced from equation (4) is 1*8 x 10“'^, which agrees 
in order of magnitude with observation, but it is doubtful how much significance 
can be attached to this, because the theoretical value changes extremely rapidly 
with the assumed values of s and P because of the subtraction of the two terms in 
expression (3) involving Ijs^ and Ijs, P is, in turn, sensitive to the assumed spacing 
of the atoms in the film. Slight disturbing effects due to the attraction of the main 
body of the liquid are possible very near its surface, but there do^ not seem to be 
a very pronoxmced meniscus in a stationary film. Dr Atkins,* from a consideration 
of his results on the oscillation of helium films, was led to postulate a law of the simple 
type toe A Izy but the constant A is left undetermined on account of the fact that the 
distribution of velocity through the thickness of a moving film is unknown. A law 
of the type (4) would conform approximately to a reciprocal law over a limited 
range of z, 

♦ I am very grateful to Dr Jackson and Dr Atkins for this information. 



A theory of the film phenomena of liquid helium n 


443 


^ 6. A THEORY OE THE VARIATION OE FILM THICKNESS WITH TE3MEERATXTRE 

The constant that appears in equation (4) may be expected to be some function of 
temperature. At absolute zero, we should expect all bound states whose energies 
are lower than that of the lowest liquid state to be occupied, and the film to thicken 
until they are no longer possible. Our imperfect knowledge of the constants involved 
makes an attempt to improve expression (3) by proceeding to higher approxima- 
tions, as would be necessary in order to calculate such a limiting thickness, hardly 
worth while. As the temperature rises, a slight energy defect wiU be needed to hold 
the atoms on the surface of the film against thermal agitation, and it is this effect 
that we want to examine. 

We take the following simplified model of the film, supposing that in the y direction 
it consists of rows of sites of equal energy, each row containing M sites, each of 
which sites contains r states of equal energy. (This is equivalent to assuming that 
the variation in the attractive field as we move parallel to the surface of the film is 
not intense enough to give more than a singk, but perhaps r times degenerate, bound 
state for each site. ) We take account of gravity by supposing that the binding energy 
decreases as z increases, and of the diminishing effect of the attraction of the walls 
by assuming a decrease in the binding energy as x increases. The energy of a site 
whose co-ordinates are {x, y, z) is therefore assumed to be 

= const, -f mgz + q{x). (5) 

We suppose that the states described by equation (5) are in equilibrium with 
a liquid that is capable of a Bose-Einstein condensation, the fundamental property 
required being that the lowest energy state in the liquid must be capable of being 
occupied any number of times up to A", the total number of atoms present. This state 
may or may not be separated by an energy gap from the excited liquid states, but 
the distribution of these states must be such as to give the condensation effect at 
the observed temperature. Now we plainly cannot ascribe to the bound states the 
property of being occupiable any number of times, because the formation of a mono- 
molecular layer only requires a number of atoms of the order of N^, so that the 
formation of a complete film only takes a relatively small number. We therefore make 
the assumption that each site can only be occupied once, the occupation of one of 
the r states associated with that sit^ preventing the occupation of any of the others. 
This assumption is not in any sense equivalent to assuming that the atoms in the 
filTn obey Fermi-Dirac statistics, for it is quite compatible with their description 
by symmetrical wave functions. The states in which one or more sites are occupied 
twice are not ruled out by any symmetry restrictions, but by the fact that the 
repulsive fields of the atoms might be expected to give such states very high 
energies. 

It is also obvious that the number of sites available in any one layer must depend 
on the state of occupation of the layers between it and the wall. We make the 
simplest possible assumption, namely, that the number of available sites in the y row 
with co-ordinates (a;, z) is just thp number of atoms occupying the adjoining y row 



444 


H- N. V. Temperley 


{pc where q is the spacing of the rows, so that a yacant site in any layer impHes 
vacancies in corresponding places in aU layers farther away from the wall. We 
consider the case of r equal to unity, the generalization for other values being 
easily made. Consider the array of sites obtained by taking a section of the film 
in an xy plane, so that z is the same for all of these sites. Let M be the number of sites 
in a ^ row. Suppose that the number of occupied sites in the rows counting outwards 
from the wall are -^^3, • • * , and that the energies associated with the occupation 

of these sites are E^, JSrg, .... The total number of possible sites, in each of these 
rows are respectively ikf , so that the number of possible ways in which an 


arrangement with these values^'of the N^s can be reahzed is ..., or 

‘ > which is the coefhcient of ..., in the expression 

(l + a-foft-f a6c+...)^* ^ configuration of the type contributes 

N2 + ...) to the total number of atoms, andiV'ijE?i+iV^2-®^2+ ••• 
energy. The toay of sites in the ocy plane that we are considering therefore 
contributes a factor 


( 1 -j- 4- -f- + • . . 


(6) 


to the partition function. Provided that we are in a region of temperature where 
the saddle-point method of Fowler and Darwin can be appHed, the condition that 
this array should be in equilibrium with the volume states constituting the hquid 
is simply 6 = cr = A, where T is the temperature of the hquid and A is the 

degeneracy parameter associated with the hquid states. Since the film only contains 
a relatively smah number of atoms, the mode of occupation of the bound states wiU 
not have any significant effect on the equilibrium properties of the hquid. They wiU, 
however, affect the transport properties of the hquid profoundly, as they may be 
expected to occur on any surface to which the hquid has access, even though it is 
covered by the hquid. It is hoped to study this point in another paper. Here, we are 
only concerned to obtain a prediction of the way in which film thickness may be 
expected to vary with temperature, for which purpose we need an estimate of 
the degeneracy parameter A for the hquid. In the Bose-Einstein gas model the 
degeneracy parameter is not significantly different from unity below the transition 
temperature, but this is only because the energy of the very lowest state is practicaUy 

, ^ q^EqIJcT 

zero. The occupation of the lowest state of the hq^d varies as where 


Eq is the energy of this state, so that below the transition temperature A should be 
practically equal to while it wiU fah rapidly from this value as the temperature 

rises above the transition point. 

Examination of the method of deriving expression (6) shows that each of the M 
factors is the partition function for a single site on the inside layer nearest the wall, 
together with the sites in the outer layers which do not come into existence until 
this one is occupied. Thus, the first term corresponds to this site being empty, the 
second one to just this site being fified, the third term to this site and the one im- 
mediately outside it being filled, the fourth term to three sites along a line drawm 



A theory of the film phenomena of liquid helium n 445 

outwards in the x direction being filled and so on. The probability of a site in the 
rth layer, counting outwards, being occupied is thus simply 


( 7 ) 


the series being continued only until positive values of appear. Unfortunately, 
the energies E^^ E^^ . . . associated with the addition of extra atoms in each layer 
caimot be inferred directly from equations (3) and (4), because we have neglected 
first of all the attractive forces between atoms in the same layer, secondly, the terms 
arising from d^’ijrjdy^^ (which may also be regarded as zero-point energy 

terms analogous to the term involving Ijs in equation (3)), and lastly the attraction 
of atoms on those nearer the wall, which means that the potential energy of an atom 
newly added to a layer is not given precisely by equation (4), but that a correction is 
necessary to allow for the 'loosening’ ejffect of the attraction of this atom on the 
layers already formed. (Incidentally the last correction is in the right direction to 
account for the discrepancy between the calculated and observed 'slope’ dtjdz of 
the film.) These points have been taken care of quahtatively by the postulate we 
made above that the spacing of the atoms in the film is the same as that in the 
liquid and is due to effectively the same balance between attractive forces and the 
increase of zero-point energy with density; It therefore seems worth while examining 
the consequences of various assumptions on these lines. 

Assumption A, The energy Eq lies above the energies E^ associated with the film 
states. This would result in all the possible bound states being occupied below the 
A point so that the thickness of the film should be constant until one goes above this 
temperature. 

Assumption B, that Eq lies below all the energies E^ associated with the film states, 
would mean that the film would disappear at absolute zero because all the atoms 
would go into the state Eq, so that we seem forced to make 

Assumption C, that the energy Eq Hes somewhere within the range of energies E^. 
At absolute zero the only liquid state occupied is the state Eq, and just those film 
states wdll be occupied that result in a gain of energy if an atom is transferred from 
the liquid to the film. As the temperature rises, atoms are excited out of the state Eq 
into higher liquid states, and som|l of the film states of higher energy than Eq become 
occupied, that is, we should expect the film to thicken as the temperature rises, 
becoming thinner as we pass through the transition temperature, that is, the film 
thickness should pass through a maximum. 

As Eq is probably of the order of — 15 cal. /mole, that is, numerically very much 
greater than the changes in energy in most of the film, layers due to the effects of the 
wall and gravity, it is not possible to predict a priori which of these three assumptions 
should be made. In § 7 we shall suggest a little evidence in favour of assumption C. 


■S^A»exp -^{E^+E^+...+EJ 
J^A»exp -^{E^ + E^ + ...+E^) 



446 H. N. Vl Temperley 

7. The 'charactbeistic velocity’ of the film 

Various experiments, such as those by Daunt & Mendelsohn (1939)? indicate that 
the rate of transfer of hehum along a surface of given perimeter varies with tem- 
perature, but is almost independent of the pressure-head between the two ends of 
the film. On the picture we have drawn, this process should consist mainly of the 
jumping of atoms from their existing adsorption sites to neighbouring ones which 
are empty, and possibly some ‘ leap-frogging ’ to empty sites in layers nearer the wall. 
Ceteris paribus, such an effect should be proportional to the minimum perimeter 
of the surface. The transition probability for such a process could be calculated 
quantum-mechanically if the height of the potential barriers between neighbouring 
sites could be estimated, but one would expect it to increase as the film becomes 
thicker and one recedes from the walls, when the spacing of the atoms probably 
becomes greater, and the barriers consequently lower. If so, most of the transfer 
is due to the motion of atoms in the outer layers, and the observed rate of transfer 
should provide a rough measure of the height of the potential barriers in the outer 
layers, and thus indirectly a measure of the thickness of the film. Thus, the fact that 
the rate of transfer seems to pass through a maximum at 1-6 to 1-8° K suggests that 
the film thickness passes through a maximum here also. The rate of transfer is 
probably a rather sensitive measure of the thickness, and the fact that it falls to 
unobservably small values at the transition temperature does not mean that the 
thickness has fallen to zero, but merely that the potential barriers are preventing 
the transfer from being rapid. The' experiments of Edstemaker (1947) prove that 
films about 30 atoms thick can be formed even above the transition point. 

[Note added in proof, 'I Since the above was written, Dr L. C, Jackson has 
kindly shown me the results of some preliminary measurements on the variation 
of film thickness with temperature. These do seem to be consistent with the 
hypothesis that the thickness passes through a flat maximum somewhere between 
1 and 2° K, and thus afford some slight confirmation of the suggestions made 
in §7. 

I wish to thank the Royal Society for the award of the Smithson Research Fellow- 
ship, during the tenure of which this work was begun. I also wish to thank Dr L. C. 
Jackson and Dr K. R. Atkms for communicating their results to me before 
publication. 


Refebehces 

Bijl, A., de Boer, J. & Michels, A. 1941 Phyeica, 9, 655. 

Daunt, J. G. & Mendelsohn, K. 1939 Proc, Roy, Soe, A, 170, 423, 430. 
E^nkel, J. 1940 J, Phys, 2, 365. 

Keesom, W. H. Schweers, J. 1941 Physica, 8, 1020. 

Elistemaker, J. 1947 Physica, 13, 81. 

Mott, K. F. 1:949 Phil. Mag. 40, 61. 

Schifif, L. I. 1941 Phys. Rev. 59, 838. 

Simon, F. 1934 Nature, 133, 529. 



Spontaneous emulsification of pure xylene in an aqueous 
solution through mere adsorption of a detergent 
in the interface 

By a. KLamutski aitd J. W. McBaust, F.R.S. 

Department of Chemistry, Stanford University, Stanford, California 

{Received 8 March 1949) 

[PliATE 16] 


Spontaneous emulsification of a pure’ liquid can occur. This is demonstrated when liqtzid 
hydrocarbons are quietly placed upon the surface of the solution of a suitable detergent. 
Here special attention has been given to xylene placed upon moderately dilute solutions of 
dodecylamine hydrochloride. 

In many cases violent disruption of the pure liquid occurs, when it is quietly placed upon 
a soap or detergent solution. 

Examination shows that the emulsified droplets still consist of pure solvent stabilized 
by a coating of adsorbed protective colloid. 

The source of the required energy is the energy of adsorption, as well as solubilization of 
hydrocarbon in the aqueous detergent. 

The emulsion formed consists of spherical droplets which therefore retain a positive 
interfacial tension. 


Intbobxjction 

Pure xylene placed upon a detergent solution spontaneously descends into the 
detergent solution and breaks itself up into fine emulsified droplets whilst retaining 
a positive interfacial tension. Here a lighter liquid goes of itself into a heavier 
liquid and enormously increases its surface. 

Having observed this phenomenon, we had to find a source for the energy 
required. 

McBain & Woo (1937), in describing instances of true spontaneous emulsification, 
gave references to the previous literature on this little-known subject, which 
began with the work of Gad in 1878 and Quincke in 1904. 

There have been only two recognized causes of true spontaneous emulsification 
occurring without any external agitation or force. Many illustrations of each were 
found by McBain & Woo. The first group is where at the interface a substance is 
formed which locally reduces the surface tension so markedly and quickly as to 
produce turbulent motion leading to emulsification.' This occurs, for example, 
when an oil containing a fatty acid is placed on an aqueous solution of alkaH, and 
soap therefore forms at the interface. 

A second group was theoretically envisaged by Rashevsky (1928), where a sub- 
g stance is dissolved in one phase but is more soluble in the other. Hence, the dis- 
solved substance diffuses across the interface, but some of the original solvent is 
carried by collision and viscous drag into the second liquid, where it remains at 
least temporarily emulsified. Examples of this had already been discovered by 
Gurwitsch (1913). 


Vol. 198. A. (7 September 1949) 


[ 447 ] 


30 



448 


A. Kaminski and J. W. McBain 

The present case is completely different, because no new substance forms at 
the interface, and nothing diffuses across it. 

It may be emphasized that we are using the term ^spontaneous emulsification’ 
in its strict sense, in the absence of even the slightest agitation or external dis- 
turbance. This distinguishes it from mere easy emulsification where with low 
interfaeial tension the liquids are easily emulsified by external mechanical means. 

While still at Stanford University, Dr J. Vinograd obtained a doubtfully positive 
result firom placing Nujol on an aqueous solution of sodium oleate. Miss L. A. 
Ehrhardt then discovered, in continuation of such experiments, that pure xylene 
placed on aqueous solution of dodecylamine hydrochloride spontaneously emulsified. 
Then in the study of emulsion polymerization at the Shell Development Company, 
Vinograd, Foiig, Ronay & Sawyer (1944), found that some other hydrocarbons 
exhibit the phenomenon. For example, a lens of ethyl benzene placed upon 5 % 
aqueous tergitol, or mesitylene placed upon 5 % Intral 231, spontaneously splits 
up into innumerable emulsion droplets. ^ 

Qtjajlitative observations by Miss L. A. Ehrhardt 

With n/10 to n/2 dodecyl ammonium chloride a uniform layer of emulsion forms 
at the interface and rapidly penetrates the aqueous phase to a depth of 1 or more 
cm. This can occur within a few seconds. On standing for weeks or months, the 
emulsion in n/2 dodecylamine hydrochloride may change into a rigid anisotropic 
liquid crystal, with a clear aqueous layer below and a clear layer of excess xylene 
above. 

The present investigation 
Hydrocarbons other than xylene 

Benzene, toluene, mesitylene and cyclohexane produced spontaneous emulsions 
on the surface of a n/5 dodecylamine hydrochloride solution. The rate of emulsi- 
fication and the extent of the emulsion layer formed decreased in the following 
order: cyclohexane, benzene, toluene, xylene, mesitylene (this is also the order of 
decreasing solubility and also of decreasing solubilization). The stability of the 
emulsion layers formed was greatest for mesitylene and least for cyclohexane. 
With benzene and toluene the emulsion layer penetrates much deeper into the 
aqueous phase than with xylene, but has a much greater tendency to change into 
a rigid gel. 

All the remaining experiments were carried out with xylene (Kahlbaum’s 
purest) and dodecylamine hydrochloride supplied by Armour and Company. 

Hydrocarbons in other detergents 

Spontaneous emulsions have been observed to form in aqueous solutions of 
several other detergents, all of which are good solubilizing agents. Cetyltrimethyl 
ammonium bromide, Detergent (a non-ionic polymer of ethylene oxide with 
an iso-octyl phenol), and Triton X-100 (similar and also non-ionic) gave violent 
emulsification with xylene and cyclohexane. Emulsification proceeded much more 



Spontaneous emulsification of pure xylene 449 

violently in aqueous solutions of Detergent ‘X’ and Triton X-100 than in dodecyl- 
amine hydrochloride. In the solution of cetyltrimethyl ammonium bromide, 
emulsification of xylene was much slower. M. H. McHan at Stanford University 
had noted that toluene emulsifies in aqueous solutions of Aresklene. 

Immobilization of aqueous layer by gelatin 

To avoid possibility of stirring and convection, n/ 5 dodecylamine hydrochloride 
solution was completely rigidified by the addition of gelatin. When xylene was 
placed upon it, the emulsification nevertheless proceeded, but at a much slower 
rate. It required 4 days to produce an emulsified layer comparable in depth to 
that which forms in the ordinary liquid solution in a quarter of an hour. However, 
the emulsification process did not terminate at this point. Columns of emulsion 
started to shoot out of the emulsion layer and went down to the very bottom of 
the container. After 10 days the whole gelatinized solution had become turbid. 
The presence of emulsion droplets in the rigid jeUy was confirmed by microscope 
examination. 

Effect of concentration of the detergent 

Spontaneous emulsification is not noticeable in solutions of dodecylamine hydro- 
chloride less than decinormal. 

With decinormal detergent, the emulsification is very slow, requiring hours or 
days for completion. Moreover, the emulsion layer penetrates downwards only 
0‘5 to 2 mm. The first signs of emulsification appear about 30 min, after contact 
with the xylene in the form of a very faint, hardly perceptible boundary at a 
distance of 0*5 to 2 mm. below the interface. The boimdary becomes more evident, 
and a milky white layer of emulsion begins to form in the space between this 
boundary and the interface. The emulsification continues more and more slowly 
for at least 1 day until a viscous layer of emulsion has formed. The amount of 
emulsion formed is independent of the amount or excess of xylene, provided the 
whole surface is covered. With a microscope it can be seen that drops of emulsion 
shoot out of the interface, travel a certain depth down into the aqueous phase and 
then return up into the xylene. It is during the downward motion of the larger 
drops that smaller ones detach themselves and are left to form the opaque emulsion 
layer. Seen from above the emulsion is white and opaque, but at a glancing angle 
the interface presents a mirror surface. 

With 3 sr /5 detergent the emulsion layer is much deeper and it forms in a few 
minutes. Emulsification proceeds as long as the aqueous phase is in contact with 
the xylene and is not separated from it by a viscous emulsion layer. 

The process of emulsification of a drop of xylene on k/ 3 dodecylamine hydro- 
chloride is shown in figure 1 . It illustrates the turbulent flow set up in the detergent 
solution. The flow is outwards from the drop of xylene, producing a rising column 
of the solution up to the drop, where the emulsion is cast off. 

With k/ 2 dodecylamine hydrochloride the whole process is much more violent 
and is completed in a few seconds. Even with 0*42sr solution, this rigid layer 
begins to form after a few days. 


30-2 



450 


A. Kaminski and J. W. McBain 


Figure 2, plate 16 , is a photographic record of the emulsification that takes place 
when xylene is placed upon four different concentrations of solutions, each in a 
narrow cylinder. Each experiment is in duplicate, one, a, with a few drops of 
xylene covering only a small fraction of the surface, and the other, b, with a column 
of xylene above the aqueous liquid. 

The photographs illustrate vividly the pronounced effect which the concentration 
of the detergent solution exerts upon the rate and intensity of the emulsification 
process. With a isr/2 or a n/4 dodecylamine hydrochloride solution, the formation 
of the emulsion is noticeable at the instant the xylene is deposited on the surface. 
In solutions which are less than x/8 the emulsion layer is perceptible only after 
several hours, and it is completely formed only after several days. 



column of rising 
Ci 2 H 25 NH 2 -HCisoln 


clear C 12 H25NH2*HC1 solution (N/S) 


Figube 1. Diagrammatic representation in cross-section of the convection currents set up 
when a drop of pure xylene is placed upon a solution of dodecyl ammonium chloride and 
begins to emxilsify. 


Spontaneous emulsification from a small lens of pure xylene on an extended surface 

The foregoing described examples in which a layer of xylene was placed upon 
a layer of aqueous solution in a cylindrical vessel. Other observations were carried 
out on a drop of xylene placed upon a large free surface of aqueous solution. 

Fate of a small drop of hydrocarbon placed upon the surface of an aqueous detergent 

When a drop of hydrocarbon or other only slightly soluble organic liquid is 
placed upon the surface of an aqueous solution of soap, bile salt, or detergent, it 
need not necessarily emulsify. It will be solubilized if its moleculer weight is not 
too great. In the simplest cases as with styrene, placed upon a soap solution 
(Vinograd et ah 1944), the droplet of styrene will retain the shape of a round lens 
whilst gradually shrinking until it vanishes, having been completely solubilized. 
No emulsion is formed unless external agitation is employed and excess -of styrene 
is present. 




4 5 

after 13 minutes 





and (6) when a complete layer of xylene is employed. 





461 


Spontaneous emulsification of pure xylene 

However, in the interesting cases discussed here, the droplet may seem to 
explode in a violent manner to form the emulsion. Alternatively, its boundaries 
may break very irregularly and shoot off a multitude of emulsion droplets. 

The concentration of the detergent solution has an effect not only on the amount 
of emulsion formed, and the rate of emulsification, but also upon the form which 
the dispersed phase takes. If xylene forms a spontaneous emulsion in a dilute 
dodecylamine hydrochloride solution (about n/10), no disruption of the interface 
can be observed under a microscope, except in the case of drops of at least several 
millimetres in diameter. These break up into smaller drops. The small emulsion 
droplets which are eventually formed are perfectly spherical and average about 
50/^ in diameter. 

In more concentrated solutions of dodecylamine hydrochloride (isr/5 to ]sr/2) the 
interface between the aqueous phase and the xylene is broken up in many places, 
and the xylene streams through the holes into the water phase only to form a new 
droplet a certain distance from the old interface. The new drops formed in this 
manner are often irregularly shaped; and with very concentrated solutions (0'83sr) 
the drops assume myeline forms. 

In the cases described, the lens does not spread on the aqueous surface to form 
a uniform visible film, and, under the circumstances, it is difficult to ascertain 
whether it attempts to form a monolayer. Burdon (1940) describes the behaviour 
of a drop of the soluble substance, aniline, on pure water. Here the same violent 
commotion is displayed. But if droplets of emulsion size are thus injected into the 
water, they are not apparent because they dissolve. If the water is already satur- 
ated, the aniline drop remains at rest. 

Temperature and stability of the emulsions 

The effect of temperature has not been fully explored. However, it may deter- 
mine whether or not a particular hquid will emulsify spontaneously. In the present 
case with xylene on dodecylamine hydrochloride solution there is no spontaneous 
emulsification above 30® C, and an emulsion made by agitation tends to break 
down. At very low temperatures the emulsion is again unstable. The emulsion is 
more stable above 15® .C, but the rate of formation is less than at room tem- 
perature. At room temperature the emulsions are extremely stable and can be 
broken up only with difficulty, in a powerful centrifuge. 

Discussion 

The main factor that produces spontaneous emulsification is the turbulence 
caused by temporary inequalities of interfacial tension. Eddy currents then tear 
off droplets of the oil. Since the detergents under discussion are all good protective 
or stabilizing agents, any small oil droplets wiU remain suspended as an emulsion. 

There is a parallelism between spontaneous emulsification and solubilization, 
and probably also solubility. The oil begins to leave the continuous oil phase, 
partly as a result of its tendency to be solubilized by the aqueous detergent. More- 
over, the detergent, due to adsorption, is much more concentrated in the initial 



452 


A. Kaminski and J. W, McBain 

interface, and the solubilized complex is known to be formed with positive affinity, 
that is, with decrease of free energy (McBain & McBain 1936; O’Connor & McBain 
1940). 

Source of the energy causing spontaneous emulsification of a pure liquid 

One source of the energy req^uired to submerge a hght liquid and enlarge its 
interface was determined by the results of a quantitative analysis for each con- 
stituent of the system before and after emulsification. 

DaM for quantitative determination of each component before and after spontaneous 
emulsification to form three layers: xylene^ emulsion, clear aqueous layer underneath 

A weighed amount of 0 - 23 -n dodecylamine hydrochloride solution was introduced 
into a separating funnel and a known amount of ^}-xylene was deposited on the 
surface. The emulsification process was allowed to proceed undisturbed for 24 hr. 
Clear xylene was next removed with a hypodermic syringe whose needle was bent 
at the end. The clear dodecylamine hydrochloride solution was drawn off from the 
bottom and the emulsion layer was washed out wdth acetone. Two such operations 
were necessary to complete the analysis: one for the determination of the amount 
of hydrochloric acid in the aqueous phase, and another for the determination of 
the total solids content of the aqueous phase. The amount of hydrochloric acid 
present was determined by potentiometric titration. The solid content was obtained 
from the weight of the residue left on evaporating the aqueous phase to dr5mess with 
an excess of hydrochloric acid. (The evaporation was carried out by heating the 
sample in a Petri dish for several hours under a porcelain radiating plate.) The 
data are tabulated in table 1. 

The result of the quantitative examination is to show that the xylene remains 
pure xylene; the aqueous solution of detergent becomes more dilute; the dodecyl- 
amine and hydrochloride are abstracted into the interface in exactly equivalent 
amounts leaving the solution of the same pH as before. Hence, unchanged deter- 
gent is adsorbed in the interface. 

Concurrently, xylene is being solubilized in the coUoidal particles of the detergent. 
For this to occur, xylene molecules must move across the interface. It has been 
shown (McBain et al. 1937) that moving molecules carry with them other neigh- 
bouring molecules and can even carry an excess of other molecules into a second 
phase, which would finally reject them. Here such rejection would take the form 
of emulsification. 

Another somewhat similar case of spontaneous emulsification, observed but not 
studied, is that indicated by Cofiman (1929) only in the title of figure 13 of his 
article. We confirm that water placed upon a 10 % solution of ferric chloride 
(FeCl3.6H20) in nitrobenzene exhibits turbulence at the interface and produces 
an emulsion of water m the heavier liquid, leaving a clear layer of w:ater on top. 

When the detergent solution is already saturated with the hydrocarbon (by 
solution and solubilization), hquid hydrocarbon placed thereon does not spon- 
taneously emulsify. 

Previous investigators have recorded a high positive temperature coefficient for 
solubilization, but it is to be noted that their measurements referred only to dyes. 



453 


Spontaneous emulsification of pure xylene 

Table 1. Quantitative analysis oe oeiginal; and final distbibution 
OF components among the pbases present 


weight of components (g.) 


phase 

(1) 

xylene 

( 2 ) 

water 

( 3 ) 

dodecyl- 

amine 

( 4 ) 

HCl 

( 5 ) 

total (for 
each phase) 

pH 

(A) xylene 

12*013 

original composition 


12*013 


(B) aqueous 

— 

73*422 

3*057 

0*664 

77*133 

2*1 

total 

12*013 

73*422 

3*057 

0*654 

89*146 


(C) xylene 

10*199 

final compositions 
0*001 0 

0 

10*200 


(D) emulsion 

1*464 

6*299 

0*433 

0*087 

8*283 

2-2*] 

(E) aqueous 

0*350 

67*122 

2*621 

0*567 

70*660 

2*1 

total 

12*013 

73*422 

3*054 

0*654 

89*143 



Calculations and sources of results 
A (1) 12*013 g. weighed directly. 

B (2) 77*133-3*711 = 73*422 g. 3*711 = (0*0481) (77*133). 0*0481 is the ratio of soHd to 

liquid content in the solution. 

B (3) 3*71 1 - 0*634 = 3*057 g. 

B (4) (0*233) (77*1/1000) (36*5) = 654 g. 

C (1) 10*199 g. obtained by direct weighing. 

C (2) 0*001 g. obtained from the International Critical Tables, 

C (3) 0. Evaporation of the extracted xylene layer did not leave any residue. 

C (4) 0. If any xylene layer is extracted from the system after the emulsification process 

has reached equilibrium and shaken with some distilled water, the pH of the water 
does not change. 

C (5) 10*200 g. obtained by addition. 

D (1) 12*013-10*199-0*350= 1*464 g. 

D (2) 73*422 - 67* 122 - 0*001 = 6*299 g. 

D (3) 0-620 - 0*087 = 0*433 g. 

D (4) 0*654-0*567 = 0*087 g. 

D (5) 8*283 g. obtained by addition. 

E (1) (0*50) (70/100) = 0*350 g. 0*50 g. is the amount of xylene solubilized by 100 g. of the 

dodecylamine hydrochloride solution. 

E (2) 70*660 — 3*188 — 0*350 = 67*122 g. 3*188 g. is the weight of the solid residue, remaining 

after the extracted aqueous phase has been evaporated with an excess of HCL 
E (3) 3*188-0*567 = 2*621 g. 

E (4) (0*220) (70*66/1000) (36*46) = 0*567 g. 0*220 g. is the amount of equivalents of HCL 

per 1000 g. of solution remaining in the aqueous phase after the emulsification 
process has reached equilibrium. 

E (5) 70*660 g. weighed directly. 

Conclusions 

(а) The xylene phase remains unchanged by the emulsification process. 

(б) The dodecylamine hydrochloride is adsorbed as such from the aqueous solution into the 

interfaces of the emulsion of p\ire xylene, leaving the pH of the aqueous layer 
unchanged. 

In the Stanford Research Institute our collaborators, Dr A. P. Brady and 
Mrs H. Huff (nee McHan), find that the temperature coefficient of solubilization of 
a liquid hydrocarbon is strongly negative. We find this to be especially the case in 
our experiments with xylene in dodecylamine hydrochloride ; the slightest warming 
of the saturated solution produces turbidity. 



454 


A. Kaminski and J, W. McBain 


A new phenomenon was also discovered by Brady and Mrs Huff. If benzene is 
added' to a clear aqueous solution of dodecylamine hydrochloride, the solution 
remains clear until it is apparently saturated with benzene ; then it becomes turbid. 
Surprismgly, further addition of benzene clears the solution again until it is finally 
saturated. These two zones of clarity exist only in the neighbourhood of room 
temperature and not above about 35° C. The nature of the intermediate turbidity 
remains to be determined. The phenomena are reversible. 

HEFERElSrCES 

Burdon, B. S. 1940 Surface tension and the spreading of liquids, p. 64. Cambridge Physical 
Tracts. Cambridge University Press. 

Cofman, V. 1939 J. Chem, Educ. 6, 1095. 

Gad, J. 1878 Arch. Anai. FhysioL, Lpz,, p. 181. 

Gurwitsch, L. 1913 WissenschaftUclie Grundlagen der Erdolbearbeitung, p. 200; translation 
by Moore (1933), p. 430. London: Chapman and Hall, Ltd. 

McBain, J. W. «fe McBain, E. L. 1936 J. Amer. Ghem, Soc. 58, 2610. 

McBain, J. W. & Woo, Ts-Ming 1937 Proc. Boy. Soc. A, 163, 182. 

O’Connor, J. J. & McBain, J. W. 1940 J. Amer. Ghem. Soc. 62, 2855. 

Quincke, G. 1904 Ann. Phys., Lpz., 15, 1. 

Rashevsky, N. 1938 Z. Phys. 46, 568. 

Vinograd, J. R., Fong, L. L., Ronay, G. S. & Sawyer, W. M. 1944 Studies in emulsion 
polymerization. Presented before the Division of Colloid Chemistry, 108th meeting of the 
American Chemical Society, New York, 13 September. 


The diffraction of blast. I 

By M. j. Lighthill, Department of Mathematics, University of Manchester 
{Communicated by S. Goldstein, F.R.S. — Received 25 November 1948) 

The behaviour of a plane shock, of any strength; travelling along a wall, when it reaches a 
corner where the wall turns through a small angle 8, is investigated mathematically by use 
of a linearized theory of anisentropic flow (§§ 2 to 5). At a convex corner pure diffraction 
occurs; at a concave comer Mach reflexion (§8). The shape of the shook (§ 6) and the 
pressure distribution over the wall (§ 7) are calculated for a variety of shock Mach numbers 
from 1 to 00. The connexion, for weak shocks, with acoustic theory, is displayed (§9). 
The work will be used in later parts to assist in a hypothetical description of the flow when $ 
is not small. 


1. iNTRODTJCTIOIf 

Let a plane stock of any strength move uniformly into still air, which is bounded 
by a plane wall perpendicular to the shock. Suppose that at a certain line (parallel 
to the shock) this wall joins another plane wall so that the two form a corner convex 
to the fiow, of angle say tt — ^. When the shock reaches this line it will be diffracted 
and thereafter will not be plane, being weaker near the wall than elsewhere. In using 
mathematical physics to predict what will happen, viscous stresses and conduction 
of heat can reasonably be neglected, since neither will have time to produce sub- 



455 


The diffraction of blast I 

stantial effects during the extremely rapid passage of the shock. In consequence 
the only physical constants defining the problem will be U, the original shock velocity 
Pq and Pq, the pressure and density in still air, and 8. Since these cannot be combined 
to produce a fundamental length- or time-scale, it follows that while, in this two- 
dimensional problem, each physical quantity is a function of the two space co- 
ordinates X, Y (with the corner as origin) and the time t, these variables can only 
occur therein in the combinations Xjt and Y jt In this part it wiU be assumed that 
8, and hence also the variations in velocity and pressure behind the shock, are small. 
By taking 8 negative the behaviour of the shock at a concave corner can also be 
studied. Previous work has been confined to the case of a weak shock (Sommerfeld 
J 901 ). 

2. Mathematical formulation 

Let the velocity, pressure, and density behind the shock before it reaches the 
corner, be p-^ and then by conservation of mass, momentum and energy at 
the shock, 

PiiU-qi) = PoU, Po^3i=55i-^>o. (1) 

assuming that air is a perfect gas with adiabatic index 7 = 1-4 (so that 1 /( 7 — 1) =f), 
which is a reasonable approximation in this problem for shock pressure ratios up to 
about 30 (for values in excess of which the problem is of much smaller practical 
importance). 

Equations (1) solved for q^, p-^, Pi give 

= Pi = iPo(U^-¥l\ Px= 6 po/(l + ^). ( 2 ) 

where = (7po/^y^o)^ velocity of sound in still air. Let M be the Mach number 
of the shock, UjaQ, which must exceed unity, and let be the Mach number 

of the uniform flow behind the shock, so that 

-^1 = qii'^Pil^Pi)'^ = [(7i|f2li)(j|fa + 5)]i- 

Then this flow is supersonic {M^ > 1 ) when M>{7 +^(Z4:))^ 3“^ = 2-068 ..., and sub- 
sonic when M < 2-068...; these cases will frequently need to be distinguished as the 
‘supersonic case’ and the ‘subsonic case’. 

After diffraction let the velocity, pressure, density and entropy at any point be 
qg, P 2 , P 2 , Choose (Z, Y) axes with origin at the corner and Z-axis along the 
original wall produced. If DjDt = 0 / 8 i -h q 2 • V signifies time-rate-of-change for a given 
fluid element, the equations of conservation of mass and momentum can be written 

^+ftdivq,-0. (4) 

and if there is no heat transfer bet-ween fluid elements by friction conduction or 
radiation the entropy will satisfy Ds^jDt = 0 . 



456 


M. J. Lightliill 


On tke assumption that qg, 532 ? P 2 differ only by small quantities from the values 
( 9^1 j Vi-i Pi '^hich they had before difeaction, the equations of motion, can be 
approximated as 




3q2 , „ 9q2 , 1 


dso 


+2x^+Pxdivq2 = 0, + = 0, + = (5) 


dt ' 


■dX'p, 


dt 


^dx 


The entropy and density variations can now be eliminated from the entire problem, 
since by virtue of the last equation dp^jdt+q-^dpJdX can be replaced by 



(6), 


owing to the thermodynamic principle that density is a function of pressure and of 
entropy alone. 

If now the transformations 


X-qit 

(l^t 








(1 +«,«), 


«x?xPx 


( 7 ) 


be made, and the fact, shown in § 1 , that u, v, p depend only on x and y, be used, then 
the first two equations of (5) become 


dp dp du dv du du dp dv dv dp 
^dx^^dy dx'^dy' ^dx^^dy ~ dx’ ^ dx’^^ dy dy' 


( 8 ) 


A word is necessary on the position of the axes for which the equations of motion 
take the simple form (8). The origin is at a point on the original waU produced. The 
part of the shock which is still straight lies along a fixed line 


X = — = 

a 




( 9 ) 


The comer is at ( — Jfi, 0). 

The conditions at a point immediately behind the diffracted shock will depend on 
the local velocity of the shock normal to itself; they will be given by the right-hand 
sides of equation (2) if C7 be replaced therein by this velocity, the direction of flow 
behind the shock being normal to it. Now in the (X, T) system of co-ordinates each 
point on the shock moves with velocity vector {Xjt, Y jt), since the whole field suffers 
a uniform expansion in time about the comer. Hence the velocity of the shock normal 
to itself at any point is h, where fh is the vector perpendicular drawn from the comer 
to the tangent to the shock at that pbint. In terms of h the boundary conditions 
at the shock are / 2 \ 

qg = fh^l , ^g = ipoih^^ial). (10) 

Let the equation of the shock in the new co-ordinates be a; = k+fiy), then f{y) 
win presumably be uniformly small if # is small. On this assumption of a nearly 


straight shock 


«h^(X- YdXIdY, --XdXjdY); 


but X = == and Y = a^ty; hence h can be taken as 

h = (U +aj{y)-(hyf’{y), - Uf'{y)). 


( 11 ) 


( 12 ) 



The diffraction of blast. I 457 

Hence, combining (7), (10) and (12), the approximate shock boimdary condition is 
deduced that, onx = k, 


V - -fin), 3> - 


Pi 


<liPi 

Hence, on « = fe. 


\J{y)-yf'{y)'\ 


u 


2(7 

C/2-^g- 

= Af, ydvjdy = Bdpjdy, 


where 


, Oi q,p,{M^-\) (Jf2 + 1) _ M ^+ 1 /7 ikf^-l \* 

2pilf2(lf2_i) 2M^ \M^ + 5) ’ 


(13) 

(14) 


p,q, U^-^l 3(if^-l) - (15) 

Pi 2U ~ M^+5 • 

Conversely, if equations (14) hold, there is a function f{y) satisfying (13); hence 
equations (14) may be taken as the complete shock boundary condition. 

The problem is now reduced to mathematical terms. The equations (8) must be 
solved under the boundary conditions that, ony = 0,v = -Sfov x> -M^ and i; = 0 
for a: < - Ifi; that onx = k equations (14) hold, and that on the remaining boundary 
between the uniform flow and the disturbed flow « = v = p = 0. 


3. Elimination of u and v 

Now equations (8) occur also in a problem which recently has received much atten- 
tion, following Busemann (1943). This is that of steady supersonic flow, in which 
departures from uniformity are small, entropy variations are neglected, and the 
velocities are constant on aU straight lines through the origin (the so-called cone- 
fleld problem). If here u, v are the disturbance velocities in the x, y directions 
(perpendicular to the main stream), where x, y and also p, the pressure, are smtably 
altered in scale, then equations (8) hold, together (however) with the condition of 
irrotationality dvjdx = duldy which is absent in the problem of § 2. 

An immediate consequence of equations (8) is that 

a second-order equation in p. When the condition of irrotationalify holds, the same 
equation is satisfled by u and by v. But in its absence it and w satisfy third-order 
equations. This indicates that the present problem should be solved in terms of p 
alone (which in any case is the most interesting variable), by eliminating u ^d v. 

As Busemann observed, equation (16) is hyperboHc for 1 and elliptic for 

its characteristics are all the tangents to the circle = 1; no other 

line, except an arc of the circle itself, can be part of a boundary between a region 
where some solution of (16) is constant and one where it is not. It is reasonable to 



458 


M. J. Lighthill 


assume that the region of non-uniform Alow (behind the shock) is the smallest region 
consistent with this fact and with the fact that the corner is either inside it or on its 
boundary. This means that in the subsonic case (when the corner 0) is inside 

the unit circle) this boundary is an arc of the circle, and that in the supersonic case 
it is the tangent from the corner to the circle plus an arc of the circle. (We shall find 
that this hypothesis yields a solution; had it not, it would have been necessary to 
allow a larger disturbed region.) 

On the x-axis is a step-function, and dvjdx — 0 except at the corner where it 
does not exist; also y = 0; hence by (8) dpjdy = 0. 

On the part of the circle x^ + y^ — 1 with y>0 and x<k, in the subsonic case 
(when this constitutes the whole 'third boundary’) p = 0; but in the supersonic case 
this is only true beyond the point of contact of the tangent from { — M^, 0), i.e. for 
X > In the region between the tangent and the circle disturbances are pro- 

pagated along characteristics; but the only disturbance is a concentrated one 
emanating from the comer; hence the flow in this region is a uniform one parallel 
to the wall, separated from the previous uniform flow by a discontinuous expansion, 
which is of course the approximation given by the linearized theory to a Prandtl- 
Meyer expansion. (Steady flow obtains in this region; the only 'unsteadiness’ 
associated with the region is the fact that it is constantly growing.) Hence on the 
unit circle, for x< — Mx^, the pressure has the value given by a linear approximation 
to the Prandtl-Meyer expansion at the comer, i.e. p^ = Px~~ 

p = -MJ{Ml-l)-K - (17) 

Thus the boundary values of p on the circle are discontinuous in the supersonic case. 
The corresponding property in the subsonic case is that, at the corner (x = —Mj), 
though {dpldy)y^Q = 0 on both sides. 


lim 


J -iifi- 


■+<!dp 

■.3y 


dx 


= -JfJ ^dx = M,d, 


(18) 


using equations (8) and the fact that v increases by — 5 at the corner. 

Prom the shock boundary conditions (14) and equations (8) only one condition 
on p can be deduced, namely, that 


onx = L (The intermediate step is that both sides are equal to 

dp/dx — ydu/dy + k dvjdy.) 

KTo other differential condition is independent of (19) when taken together with (16). 
However, it is necessary that = — ^ at (ik, 0), and hence that (if the integral be taken 
along the shock from the wall to where it becomes straight) 

When p has been obtaiued, u and v will be deducible from the second and third of 
equations (8), which specify their rate of change along radii vectoresj starting from 



459 


The diffraction of blast I 

their values on the shock, which are deducible from those of p, and their values on 
the unit circle, which are known to be zero (except on part of it, in the supersonic 
case, when they are given by the linearized theory of the Prandtl-Meyer expansion). 
Since v ^ at points of the waU on both sides of the origin (by conditions (17), (18) 
and (20)) and is constant on the two radii vectores with == 0, this boundary con- 
dition must be satisfied aU along the wall. Atx = y — Q both u and v (but not p) are 
discontinuous, being different on the different radii vectores. But this implies no 
physical difficulty, since the acceleration of a fluid element, 

— t~^{x dujdx -{-ydujdy, xdvjdx + ydvldy), 

is finite. This is because the motion of the element in the reduced plane {x, y) is 
proportional to the vector — {x, y). More difficult is the fact that if (as will appear for 
all values of M)dpldx+0Bjtx = y = 0,u must be logarithmically infinite at this point, 
by (8) (though the acceleration Vill still be finite); this conflicts with the assumption 
of small perturbations. However, such a logarithmic infinity occurs m some other 
'cone-field’ problems, and also in simpler problems treated by the linearized theory 
where the solution is known to be correct elsewhere and to possess a finite peak 
corresponding to the logarithmic infinity. The pressure will be found to be bounded 
(except at the corner in the subsonic case, which is also well-authenticated in other 
problems), and this is a strong indication of correct approximation of the truth. 

The boundary conditions appropriate to equation (16) for p are now complete in 
the domain x^+y^<l,y>0,x>k;m this domain it is eUiptic, and Busemann showed 
how it can be transformed into Laplace’s equation. 


4. Bxjsemann’s traksfoemation 


In polar co-ordinates with ic = r cos0, y = rsin^, equation (16) becomes 

dr^ rdrr'^dm \dr \}\drj’ 
and with p = [1 — (1 — r^)^]/r this becomes Laplace’s equation 

= 0 

dp^ p dp p^ dd^ 


( 21 ) 

% 


( 22 ) 


in (p, 6) as polar co-ordinates; while the circle r = 1 becomes the circle p = 1, since 
p increases from 0 to 1 monotonically as r does so. Bhirther, r = 2p/(l -tp^), so that 
the line x = k becomes an arc of the circle 2p oos6 = *(1 -hp^), cutting p = 1 ortho- 
gonally at cos 5 

The shock boundary condition (19) is transformed into the new co-ordinates by 
use of the relation 


dpjdx (dpjdr) cot 6 -hdplrdd _ {rp~^dpldr) jdpldp) cot 6 + dpjpdd 
dpjdy dpjdr -h {dpirdd) cot 6 {rp~^dpldr) (dpjdp) -f- (dpjp 80) cot 0 

But if dn and ds are elements normal and tangential to the circular arc 

2pcos5 = ifc(l-hp^), 



(24) 


460 M. J. Lighthill 

respectively towards its centre and away from the line d = 0, and (p is the acute 
angle made by the arc with the radius vector, then 

dpjdp _{dplds)ootp + dpldn 
dpjpdd dplds—{dpldn)cat<p' 

Hence 

dpjdx _ [{rp-'^ dpldr)cat4> ootO— 1] (dpjds) + [(rp-^dp/dr) cot d+ cot0] {dpjdn) 
dpldy ~ [{rp-^dpjd^) cotp + cot 0] {dpids) + [(rp-^dpjdr) — cotp cotd] (dpidn) 

(25) 

But rp-’^dpjd/r = (1+p*) (1— p®), and on the arc this is cosd(cos^d— /c®)“*. Also the 
geometry of the triangle formed by the origin, a point (p, 6) on the arc, and the point 
{ifc-\0) which is the centre of the arc, shows that cos^ = (1 — P)“isin^; hence 
rp~^dpjdr = cot^ cotd, and the right-hand side of (25) simplifies to 


? cot® (j> cot® d— l-i-cosec®d cot p 


IdrtT 

]^_ • 


The left-hand side being given by (19), (25) now becomes 

(1 — sec® 6)^ dpjdn 


{A + Jc)y-Blcy-^ 
l-jfc® 


jtand+- 


(26) 


(27) 


1-*® 1-ifc® dpjds’ 

and (with y replaced by its value &tan5 Ga.x = h) the condition satisfied by p on 
the arc 2p cos d = &( 1 4- p®) becomes 

dpjdn _ Aifetand— Bcotd 
dpjds 


(28) 


(1— A;®sec®d)t 

The other boundary conditions are unaltered, except for the discontinuity condition 
(18) at the comer in the subsonic case. Putting ^ = pcosd, =psind, the comer 
becomes the point ^ — y = 0, and the condition holding there 

is that 




i.e. 


lim 

17— 


J 


= [rp-^dpldrl 





(29) 


(l-ilf®)i’ 

(30) 


5. SOLXJTiOISr OF the potential pboblem 

Now p is given as a harmonic function satisfying certain boundary conditions 
in a curvilinear triangle ABC with AB and BO circular arcs and AC b, straight 
segment, and all its angles right angles. To solve this potential problem conformal 
transformation into a simpler domain is necessary. 

Now geometrical inversion of this curvilinear triangle with respect to B will 
produce a triangle A^B'C' with JS' at infinity, A'B' and B'G' straight segments and 
A' O' a circular arc, and all its angles (which are unaltered by inversion) right angles. 
But if ^ the transformation 

^-»+‘-^)(--pr|^). 


(31) 



461 


The diffraction of blast. I 

where B is the point ^ = h+ih', i.e. h' = performs this inversion, together 

with reflexion, translation and rotation which do not affect the qualitative state- 
ment. Further, when ^ is on the arc 2p cos d = i(l -l- p^), 

(cos^d— 




(32) 


k' cosd — isin^’ 

which is purely real and increases from 1 to oo as 0 increases from 0 to 

tan“^ {h'lk) = eos"^ h. 

Hence the triangle A'B'C' must consist of that part of the first quadrant which is 
outside the unit circle. 

Equation (31) can be solved for & as 


hence in the Z-plane the boundary condition (27) becomes 


dpjdn _i Ak'{Z^-l) 
dpjds ~ \ Z^+1 


Bk(Z^ + l)] f/2k'Z\ 

■ k'{Z^-i) jlW+i)- 


(33) 


(34) 


The left-hand side is unaltered (since in conformal transformation the local deforma- 
tion of plane elements is a pure dilatation) provided that dn is still an outward normal 
to the domain and that cfo is in the positive tangential direction. 

Lastly, with = i{Z^+Z-^), Z^ z^+{zl-l)i, (35) 

the 2 :i-domam corresponding to the triangle A'B'C is the upper half-plane. The 
shock, to which the part of the real axis with Z >1 corresponded, becomes the part 
of the real axis with > 1. The wall, to which the first quadrant of the circle \Z\=^ I 
corresponded, becomes the part of the real axis with — 1 < % < 1 . The third boundary 
r = 1, or p = 1, to which the part of the imaginary axis with ^Z> 1 corresponded, 
becomes the part of the real axis with % < — 1. ' 

The shock boundary condition (34) becomes (with = x^+iy-^ the condition that 

dpj^ 

dpjdx^ 

on a?! > 1, 2 /i = 0. 

The wall boundary condition is that dpjdy^^ = 0 when — 1 < % < 1, y^^ — 0. The 
discontinuity condition (29) becomes 

( 1 - 




(36) 


Hm 




(37) 


and bolds at the point corresponding to ^ = — (1 — (1 — Jf|)*/ilfi, which is 


{M^+k)^+{Ml-l)(l-k^) 


= a:o>-L 


(ifiA-Hl)2 

The condition on the third boimdary can be written dpjdx^ = 6 when % < 
2/i = 0. But in the supersonic case this must be supplemented with 


(38) 

- 1 , 



462 


M. J. Lighthill 


by (17), an equation which holds at the point in the 2 :i-plane corresponding to 
^ point is found to have exactly the same algebraic 

expression as the Xq of (38), only in the supersonic case < — 1. 

The solution is effected by the introduction of a function 

w(Zj^) = dpldy^ + idpldx^, 


which is regular throughout the upper half-plane since p is harmonic. In terms of w, 
the discontinuity conditions (38) and (39) can be expressed by sajdng that, near 

— Xq 

w~ ^ — i i- (supersome), w ^^(suDsomo). (40) 

% ^0 % ^0 

Further, w is real on < — 1, = 0 and purely imaginary on — 1 < < 1, = 0. 

Lastly, on % > 1, = 0, 

, [^(^1-^)]* arctan^^^*+arctan^^i^^ 

(41) 


argio(zi) = arc tan • 


where 


aygV2 2Bk V2 




Bh 


a+/? 1-P’ a+/(? "" 

so that, by (9) and (15), 

cc^ = a+^ = 2M^k^2, 

whence (since Mk > 1) both a and are positive. 

A function satisfying equation (41) is 


(42) 

(43) 


[a - i{z^ - !)*]-! [y(? - i(z, - l)i]-S (44) 

where (zi— 1)* has' non-negative imaginary part in the upper half-plane. This func- 
tion is regular throughout the domain, and even on the boimdary f or =(= 1 ; it is real 
and positive for oii < 1 , = 0. The condition (41 ) will still be satisfied if (44) be multi- 
plied by any function real for ajj > 1, = 0. The functions chosen are, first, (z| - 1)-* 

(the branch positive for > 1, y = 0; it is purely imaginary for - 1 < cCi < 1, y^ = 0) ; 
secondly, — G^(zj— ajg)”^, where (7 is a constant to be determined by conditions (40); 
and thirdly, 1 where D is to be determined by condition (20). The com- 

bined expression 

/ ^ C'^[X)(zi— a;n) — 1] 

“ ( 2 !- 1)* (2i-*o) [a-i(zi- 1)*] W-i{h.- 1)^] 


is the umque function satisfying all the boundary conditions and integrable at every 
point including infinity but excluding x^. 

The constant (7, by (40), satisfies 


7rll-ilff|i ' ’ 

It has a finite non-zero limit for = 1, oto = - 1. Since, by (33) and (35), on arj > 1, 
yi = 0, y is given by 



The diffraction of blast. I 


463 


equation (20) becomes 


s- r^^dx - + CS[D(x^-x,)-l] ( cc+f)(x,-l)i 


l)i (aji-aio) (a^+aii- l)(^+a:i- 

D{x+y^) — l 


1 ) 


dxj^ 


dx 


where 

Since by (46) 


^ BOS{a+fi) 1*“ 

k' Jo x^{x+0L^){x+^){x+y^) 

_ BOS{a+^) (Jy^— l)(K+yg+y)/ajgy+J) 

¥ ^ - (a+y^)(y?+r)(r+a) 

y^=l-Xo = 2{M^ + k)^l{M^k+l)K 


(48) 

(49) 


0 1-xl iM^ 

2(lfi + &)2 2( 1 - if?) ( 1 - P) 

* ifi 

(y?+7)(7+“) ~ l — 

(lfiA:+l)2 [M^k+lf 

TT 


2M^k'(Mi+k) 
7T{M-Jc-\-Vf ’ 

the condition determining D becomes 


(50) 


(ifi^+l)^ p (y+a)(y+yg) a+f+y 
2BMf,M^+k) af afy ' ^ ’ 

whence it is easily calculated using (43). 

The solution is now complete, and only back-transformation and computation 
are necessary for the deduction of physical results. These have been computed, and 
are set out in §§ 6-8, for the following values of M : 1*36277, 1-64751 , 2-06809, 2*95200, 
00 . (The limit as If 1 is discussed in § 9.) The corresponding shock pressure-ratios 
!PiIPq, Mach numbers behind the shock and positions of the shock (relative to 
the radius of the circle of propagation) k are given in table 1 ; these explain the choice 


of values. 

M 

1 

1*36277 

Table 1 

1-64751 2-06809 

2*95200 

00 

PiIPo 

1 

2 

3 

4*82315 

10 

00 


0 

0*47245 

0-72739 

1 

1-34463 

1*88982 

h 

1 

0*75693 

0*65465 

0*56619 

0-47809 

0*37796 


As Jkf ^ 00 , by (43), oc 2M^k ^2 and yff 1/i ^J2. Hence by (45) and (50) the limiting 
expression for w{zj) is 

2M,¥{M,+k){j8+y) ^EJ(%-^o)-l] 

■ 7r(lfiA+l)2 

which is finite and in no way more singular than (45). Thus the pressure and velocity 
fields reduced in scale (proportionately to the pressure and velocity behind the 
straight part of the shock) have a finite limiting shape and distribution as Af-^oo; 
these are treated below on the same footing as those for the finite values of JIf 
considered. 


VoL 198. A. 


31 



464 M. J. lighthill 

6. The shape op the bippracted shock 

At a point {h,y) of the shock, or in the plane (»!, 0), the shock curvature 

(taken positive when the shook is convex to the still air) is 

_ \ ^ B dx-^dp + dp 

ydy~ y dy dx^ 1-k^ 3*1 

BCS(a+fi) (*1 + 1)* [I>(*i - *o) - 1] f 53 ^ 

“ (1 -P) {x^-xo) (a^+x^- 1) 1)’ 

by (47) and (45). KjS is graphed against yjk' (which runs from 0 to 1 on the curved 
part of the shock), for the four finite values of M listed above, in figure 1 , where the 
limit as Jf->oo, which is 

x 2M^k'{M^ + h){P+y) .B(a^ + l)n^K-=«o)-l] 

~S~ 7^(J^l^•+l)2 (l-P)(*i-*o)(/^^+^»i-l)’ ^ 

is also shown. While (53) is zero at *1 = oo (y = h'), (54) is infinite; the figure shows 
that this mfinity is the limit of a very steep peak, which occurs just before y = h' for 
M<co. The mfinity is only one of curvature, and leads to no great peculiarity of 
shape of the shocks themselves. 



yjk' 

Figijre 1. Ciirvature of difiracted shock. The numbers on the curves are the values of 


It is observed that, for the larger values of (or of Pijpo ) , k is negative for small y ; 
so that the shock is concave to the still air near the wall, changing to convex farther 
out through a point of inflexion. This is due to the term in square brackets in (53) 



The diffraction of blast I 466 

and (54); the point of inflexion is at and appears if Xq-\-D~^'^ > 1. The 

transition occurs when D = hence by (51) and (49) when 




(55) 


whence, by (3), (9) and (15), M = 2-53111 andp^ = 7-30760po. Hence it is for shock 
pressure-ratios >7-31 that a point of inflexion should appear in the difiracted shock. 
It will be observed that the maximum weakening at the waU for an entirely convex 
shock, which would be obtained by a function K{y) always non-negative but entirely 

concentrated near y ==¥ (and still satisfying the condition J {K[S)d{ylh') = Ifh', 

which is a check on each of the curves in figure 1), would be obtained by drawing the 
diffracted shock as a straight line normal to the wall: but that greater weakening is 
possible once a point of inflexion is allowed. 

The shapes of the shocks, and of the regions behind them which are disturbed 
from uniform flow, for ^ = 0-1 radian, are shown, together with the wall pressure 
distributions, in the five cases considered, in figures 2 to 6, whose scales are chosen 
to make the distance between the corner and the shock the same in each ; the 
thick line represents the shock. 


7. Pressure distribution along the wall 


At a point {x, 0) of the wall { — l<x<k), the % co-ordinate is 


% 




and satisfies — 1 < ajj < 1. Now in this region, by (45), 
dp _ GS\P{x-i^—XQ) — l'\ 

(1 -*!)* (a:i-a:o) [« + (1 -^o)*] L^+ (1 



(56) 


(57) 


Figure 2. Wall pressure distribution and shape of disturbed region (^ = 0*1 radian, = 2). 


3^-2 



Pt~pt 

S{Pi-Pv) 



Figube 5. (^ = 0-1 radian, pjp^ = 10.) Figube 6 . = 0*1 radian, pjpo = oo*) 

Figxibes 5, 6. Wall pressinre distribution and shape of disturbed region. 


(withL a similar expression corresponding to (52) for Jkf-^oo), and while p conld 
th^retically be integrated from (57) as an elementary function it is a simpler matter 
in computation to use numerical integration. Th.e integration is best carried out 
over equal intervals of the angle cos“"^a;i, a device which eliminates the singularities 
at = ±1, When so that — the known singularity of (57) at 

a?! = Xq must be subtracted before integration, and the integrated singular part 
added to the result. 


The diffraction of blast I 467 

The quantity chosen to be graphed against x in figures 2 to 6 is the relative 
deficiency of pressure at the wall, divided by the angle S in radians, that is, 

P1-P2 P\_ P /.ov 

S{Pi-Po) Pi-Po\ S) H* ^ ^ 

The qualitative results which appear could have been simply deduced from the form 
of (57): that in the subsonic case the deficiency rises firom zero at the boundary of 
the disturbed region to a logarithmic infinity at the comer, then falls to a minimum 
and rises again to its final value at the shook: that in the sonic case (Jfi = 1) the 
deficiency falls from its algebraically i nfi nite value (to which at the corner it has 
suddenly jumped from zero) to a minimum and then rises: that in the supersonic 
case it has a Prandtl-Meyer discontinuity at the corner from which, in — 1 < a; < ifc, it 
falls and then slightly rises again when Pijpo < 7-31, but falls monotonically when 
Pi/Po>'7*31- The logarithmic infinity in the subsonic case corresponds to a rapid 
flow round the corner. In the sonic case the true {Pi—P 2 )l{Pi^Po) immediately 
behind the corner is probably equal to the Prandtl-Meyer value with sonic initial 
velocity, which is of order for small so that it is not surprising that the present 
theory gives lim {Pi—P 2 )l^{Pi’-^Po) = 00 at this point. 

<3nantitatively it is seen that the scale of even the relative pressure deficiency 
{P 2 ~'Pi)liPi^PQ) mcreases with the shock pressure-ratio pJpq^ All the pressure 
distributions given have been checked satisfactorily by calculating the pressure at 
the junction of wall and shock independently by the integration of as obtained 

in § 6, using equation (13). 


8. Nature of the third boundary for convex and concave corners 


From equation (45) it is possible to calculate dpjdy^ on iCi < — 1, = 0; and, in 

Busemann's (p,^) plane, the value of the pressure-derivative dpjdn, along the 
inward normal on that part of the boundary with p = 1, is deduced therefrom by 
multiplying by a factor j dzjd^ j which varies but remains positive. Thus, when 5 > 0, 


dp 

dn 


dz^ CS[D(x^ — Xq) — 1 ] 

^ - {xl- mx^-x,) [a-f- (1 -a:i)^3 [yS-f ’ 


(59) 


which is negative when x^<Xq, but is positive when Xq<Xj^< (provided also 
< x^+D~^; but actually Xq+D~'^ > — I for aU M). 

But in the original plane (r, 6), {dpldr)^j^ is infinite, since it is obtained from (59) 
by multiplication by — [dp/dr]^^i = 00 , The actual behaviour near r = 1 is deduced 
from the asymptotic equality 1 — p ~ [2(1 — r)]^ as 

(60) 


This behaviour is probably not what really occurs but is a singularity which is the 
only way that the linearized theory knows of describing some more complicated 
phenomenon, such as a shock or a rapid expansion. The same behaviour of the 
solution occurs in the cone-field theory of supersonic steady flow, referred to in § 3, 



468 


M. J. lighthiU 

Experience in the latter field points to the tentative conclusion that the true pheno- 
menon is a shock when dpldn>0 and an expansion (rapid but not discontinuous) 
when dpjdn < 0.* In the latter case the boundary must be the circle exactly, since it 
must be a characteristic: but in the former the shock must be slightly further from 
the origin than the circle, in order that its own motion shall be supersonic. 

On this assumption, and taking 5 > 0, the whole circle represents an expansion in 
the subsonic case: it is accordingly dotted in figures 2, 3 and 4. But in the supersonic 
case < — 1, and the circle is an expansion for < Xq, i.e. for poiats to the right of 
the point of contact of the tangent from the corner; and this part, together with the 
tangent representing a Prandtl-Meyer expansion, is shown dotted m figures 6 and 6; 
but the remainder of the circle, shown plain, has dp ldn>0 and probably corresponds 
to a weak shock. 

But when d<0, so that the comer is concave to the flow, the sign of dpjdn is 
changed throughout and the whole circle corresponds to a shock in the subsonic case, 
which in the supersonic case is replaced by part of the circle plus the tangent from 
the comer. The shock should be drawn slightly away from the circle, except where 
it joins the main shock (x = k,Xj^ = oo), when by (59) dpjdn — 0 . (The exact position 
of the straight obfique shock in the supersonic case is known from steady flow theory. ) 
The shock pattern is sketched in two cases {pJpq = 3 and 10) m figure 7. The type 
of three shock intersection that occurs is similar to the well-known experimental 
phenomenon of ' Mach reflexion ’ . . 


Figttbe 7. Corner concave to the flow: three shock intersections (^ = —0*1 radian, pjpo = 3,10). 

9. LlMITmO SOLUTION AS M^ll CONNEXION WITH THE ACOUSTIC THEORY 

As if 1 the origin of (rr, y) co-ordinates approaches the comer, which in the limit 
becomes the centre of the circle of disturbance, to which the undisturbed portion of 
the shock is a tangent. In the acoustic theory of diffraction round a corner (Sommer- 
feld 1901 ; Friedlander 1946), with S not small, the diffracted wave front is similarly a 
circular arc with the corner as centre and the undisturbed wave front as tangent. 
In this section the wall pressure distribution, given by the present theory (which 
has treated the limit of the problem as <^->0) when M-^1, is compared with that 
given by the acoustic theory (which lets Jf-^1 at the start) when ^->0. The in- 
version of this double limit is found to be valid, but not uniformly so near the 
wall-shock junction. 

Let Jf = 1 + e: then from (9) 1 — A e as e-^ 0. By (56), for fixed ic < 1 as e--> 0, 

Xj^ + l'^4e- . 

l—x 

* The author will shortly publish a proof that this is so in supersonic steady flow. 




( 61 ) 



469 


-§-^-2 
40 = • 


The diffraction of blast. I 

But a, /?, y all tend to f2 as e-^ 0, so by (oO) and (51) 

0~im^k'l7T and 

Hence in (57) (cci— neglected (in the limit) in comparison -with D, if 
aj+O; and 

dp dxJ{-im^h'jTT)^{BMfr^ 8e d[-16V(2e)/7r](^i) 
dx~ dx [8e(l+x)l{l-x)}i8 '"(1-xf [8e(l+x)/(l-a;)]i8 

- 

~ (l-a:) 4 (l+a:)i’ ’ 

The reasoning is not uniformly correct near x = I, since (61) fails here. The value of 
p at the wall-shock junction (deduced from (57)) as e->0 is in fact 

dx^ 


■CD8 


= % 

TT^e 


'f — 

Jo(l- 


4^2 d 

y^)^{i+y)^ Stt ^e' 


(63) 


(But t.big does not mean an infinite pressure as e-> 0 since the actual pressure differs 
from Pi by Ui^iPiP ~ |ePoP)- 

On the acoustic theory there is no discontinuity at the refracted part of the shock 
(Sommerfeld 1901 ). Probably this is b.ecause in fact the discontinuity is o(e) as 0. 
Thus, in the notation of the present paper, while p = 0 for r = 1 , 0 < (9 < tt, the equa- 
tion holding for — ^ < d < 0 is 

(64) 

OlglPl OSiPi 

where equation (1) has been used. Nowp satisfies ( 16 ) as before, and hence also (22). 
The ^-plane is a sector of angle n + 8 -. The transformations 

= E^i, -i(Zi+Zri) = Z2, (66) 

transform it into the upper half Zg-plane; in which the boimdary conditions are 
ap/0y2 = 0 for ya = 0, | a;^ j > 1; p = 0 for = 0, -cos[7r2/(7r-|-0)]<a;2<l; p = -1 
for y2 = 0, - 1 < aja < - cos [n^j{ 7 r + d)]. 

The solution with p bounded is given by 


3p . 0p 


0y2 ^ 0a:2 (^ 2 + cos + 0)]) (1 - z|)i ’ 


—TT^ sin [tt^I (tt -1- 0)] 


which yields in [ 0:2 j > 1, y 2 = 0 

0p — ?r~^sin [7r^/(7r-h0)3sgna;2 


— 7 r~^ 0 sgna :2 


0 a :2 {X2+OOS[7T^I{7T + $)])(3%-l)i (Kg- 1) (ir|- 1)*’ 


as 0^0. But 




1 fl - ( 1 - a:2)i ^ 1 + (l-a;^)n 


X 


J 

0p 1 — TT-’-dsgna: 

(ar-i- 1) (ar-2- 1)» “ (l-a;)(l-a:2)i’ 


X 

— 0 / 7 r 


1 

x’ 


( 66 ) 


(67) 


( 68 ) 


so that 


(69) 



470 ; M. J. lighthiU 

as ia (62). But the p given by (66) is contmuous except at Zj = — cos [Tr^Hn+S)], so 
that its value at the ■wall-shock junction is — 1, in conflict -with the other non-uniform 
double limit (63). 

If the true value were — 1 — (4^2/37r) the limit as flrst S and then e-^0 would 
be — 1, and the asymptotic form as first e and then ^->0 would be (63). But there 
may be other important terms in (p)x^x for small 8 and e, which both methods screen. 
The function, deduced j&rom (62), 


Pi-Pz _ P ^ fl+^ 
^(Pi-Po) ^ 


(70) 


therefore indicates the wall pressure-distribution (for 8 small and Jf — 1 = e small) 
correctly, except that the iufitnite limit as a:-^ 1 conceals a true finite limitiTig value 
containing terms 0(^“^) -f 0(e-*). No attempt has been made, however, to correct 
the value of (70) near a; = 1 in figure 8, which is the counterpart of figures 2 to 6 for 
M-^l, and is drawn on the same scale. 


0 

§ / 
/ 






/ 


./ 

/ 


•#* 


////// 


5'Zij 

' 'y 






\ 


\ 


N 


\ 


.4 


Figukb 8. Wall pressure and shape of disturbed region {d = 0-1 radian, pjpf, = 1). 

A formal conclusion to this paper is omitted, since deductions not already drawn 
and presented in figures 1 ■to 8 are postponed to later parts. 


Rexeebstces 

Busemann, A. 1943 LuftfaJtr^orsch. 20, 105. 
Friedlander, F. G. 1946 Proa. Boy. Boo. A, 186, 322. 
Sonunerfeld, A. 1901 Z. Math. phys. 46, 11. 



Kinetic theory of diffusion in gases and liquids. 

II. G-eneral kinetic theory of liquids mixtures 

By L. M. Yaistg, University of Edinburgh 

{Communicated by M. Born^ F.BB. — Received 8 December 1948 — 
Revised 11 March 1949) 

In this paper the classical kinetic theory of simple liquids developed by Born & Green is 
generalized for liquid mixtures. Though the fundamental equations preserve many of the 
general features of those relating to simple hquids, emphasis is laid on certain points cha- 
racteristic of a liquid mixture. In the expansion procedme, improvement has been made 
in the choice of parameters; the potentials of the external force fields are taken as para- 
meters instead of the external forces in order that force diffusion should follow naturally. 
The hydrodynamical equations for a liquid mixture follow after a set of relations between 
the observable quantities and those defined with respect to each component in the mixture 
have been obtained. These equations when compared with the corresponding ones for gas 
mixtures reveal the peculiar complicated featmes of liquid mixtures. 

A practical method of solution for the expanded equation of motion is discussed and 
general expressions for the coefficients of ordinary diffusion and thermal diffusion and also 
the coefficient of viscosity in a liquid mixture are given in the last section. 


1. Ikteobtjction' 

While the kinetic theory of gases and solids has progressed a long way towards 
perfection, little has been done on the kinetic theory of liquids owing to the fact 
that no simple assumptions are available as in the case of gases or solids. There are, 
however, quite a number of fragmentary theories devised for special purposes as 
summarized by Frenkel (1946). Recently in a series of papers by Born & Green 
(1946, 1947 a, b), and independently by Kirkwood (1946, 1947), a new attempt has 
been made towards a rigorous and systematic formulation of the kinetic theory of 
liquids. The essential feature of this theory is the introduction of multiple distribu- 
tion functions which make possible the study of molecular clusters. Though a con- 
venient basis has been set up, there remains much to be done to bring the theory 
nearer to the actual calculation of empirical constants from atomic data. The 
present paper deals with the case of a liquid noixtuxe, proceeding along much the 
same line as part III of Born & Green's paper. Fundamental equations of conserva- 
tion for a molecular cluster are derived from LiouviUe's theorem in classical phase 
space. These equations are shown to reduce to the macroscopic hydrodynamical 
equations with the help of a set of relations between quantities actually observed 
in a liquid mixture and those defined with reference to the molecules of a particular 
kind in a nodxture. Comparisons are made on several occasions with the example of 
a gas mixture, thus enabling the peculiar complicated features in a liquid mixture 
to be seen clearly. 

In the expansion procedure, improvement has been made in the choice of physical 
parameters; the potentials of the molecules in the external force field instead of the 
external forces themselves are taken as parameters in order that force diffusion may 
follow naturally from the expanded equation of motion. An exact analogue of the 
theory of gas mixtures is obtained in the expanded equation of motion except that 

[ ^71 1 



472 


L. M. Yang 

the collision integrals in the gag theory are replaced by certain integrals involving 
the velocity-distribution functions of pairs of molecules, and that there are additional 
terms involving the potential pressures which are taken to be zero in the gas theory. 
A practical method of solution is discussed in § 6 which consists of modifying the 
assumption of binary encounters in the following manner. As one cannot expect to 
consider interactions of several molecules at a time, the expedient thing to do is to 
consider the motion of one molecule relative to another in an averaged field due to 
the presence of the others. This field is taken to be the same as if the fluid were in 
equilibrium. By this method the pair-distribution functions are expressed in terms 
of single-distribution functions, and a closed integral equation, which may be 
regarded as a generalization of Boltzmann’s equation, is obtained. No numerical 
calculation is attempted in the present paper, as this will necessarily be a consider- 
able task. General expressions for the coefficients of diffusion and viscosity obtained 
from quite general considerations are given in the last section. 

2. Notatioit akd defikitions 

Consider an assembly of two different kinds of molecules of total number N 
occupying a total volume V. Let N^, be the numbers of molecules of the first and 
second kind respectively, and their molecular masses. Of these N ( = -h Yg) 
molecules, one can choose any of the first kind and any \ of the second kind for 
special consideration. 

Let the velocity of a molecule of the first kind at the position be denoted by 
To abbreviate, we shall use Xi and to denote all the position and velocity 
vectors of the molecules and and those of the molecules. The volume 

elements JI n dxi^dxf wiU be denoted by and n fl by d%j, ^ . 

1 i=l5=l ^ ® 

Next, multiple-distribution functions symmetrized with respect to molecules of 
the same kind will be defined. The distribution function is 

defined such that is the probability of finding the volume elements 

occupied by any molecules of the first kind and the volume elements 
dx ^^ . . . by any molecules of the second kind simultaneously at time t. Similarly 

Aiftj = is defined such th&t is the probability of 

finding the volume elements x^f, dx^'> {i = 1, 2, occupied by any molecules of 
the first kind with velocities in If, dii® (2 = 1,2, mdx^, dx^{j = 1,2, 
occupied by any molecules of the second kind with velocities in 
smultaneously at time t. It follows fi-om the above definitions that 

( 1 ) 

Since the quotient i — represents the probability of finding a molecule 

of the first kind at xf»+w, dxfi+i> knowing that dx^^^^is occupied by molecules 
of the first kind and of the second kind, one has 


( 2 ) 



473 


Kinetic theory of diffusion in gases and liquids. II 

Similarly, ~ 

By repeating the above process it can be shown that 








When either, one of and is zero and the other is one, single-index notation 
wiU be used, e.g. %„ and/oi will be replaced by and^ respectively In accordance 
with the usual notation, % and % are the number densities, and/i and/ 2 the velocity 
distribution functions of the two kinds of molecules normalized with respect to their 
number densities. The partial mass densities arepi = «ii% and = m^n^. The tota 
number and mass density are «. = %-!-% and p = P 1 +P 2 respectively. 

3. Ftodambntal conservation equations 

AH fundamental conservation equations will be derived from Liouville’s theorem 
in classical phase space of the system of molecules considered in § 2. Before denving 
these equations a number of quantities pertainii^ to the set of molecules A^, ^ 

be introduced; the Hamiltonian h,) for the set of molecules K is given by 

J?(AiA2) = £:(AiA2)-1-TF(AiA2)-1-JS?(AiA2), , (5) 

where 

w(hk,) = 1 , 11 /® ■ 

- E(hiK) = S Ef>+ S 

i=l j=l 

Am = is the potential energy of two molecules of th^e first kind 

Am = A (ixy)-xi*^|)thatoftwomoleoulesofthesecondkindand - til'll ^ |) 

that of two molecules of different kinds, it being assumed that the intermolecid^ 
forces are central and depend only on the distances between the mass centres of the 
molecules. is the potential energy of a moleciffe of the fet W at 

xi« in the external force field assumed to be conservative and E^z_ = E^{x^ ) that ot 

a molecule of the second kind at x^\ ^ , u 

The drift in phase space is governed by the liouville s theorem 

where the square bracket [ ] is the classical Poisson bracket defined by 

1 -VW 0a da 0;g \ 1 ^ (8) 

^ Si 0xf ; ■’’ma Si ' 



474 


L. M. Yang 

By integrating both sides of (7) over the position and velocity co-ordinates of 
all the molecnles except the set h^, one obtains the equation of motion for/^^^^ 

where = S S 

i—l i=l 

k, /H 1 

^ * j=l i=l 

In the following i and h wiU be reserved for indicating quantities pertaining to the 
first kind of molecules, j and I for the second kind of molecules, and no mention will 
be made of the kiud of molecules concerned. 

Let be any molecular property of a molecule at with velocity 

5^*^. (p^i^ may be a scalar, a vector or a tensor. Multiplying (&) by and inte- 

grating over the velocities, we shall show that for cp^ = 1, and the equation 
of continuity, of conservation of momentum and energy of the set will foUow. 
For (p^ == 1, one obtains 

0 r 

0^%i7t2 j i^i^l^2)9fhihzl ^^hihzi (H) 

since E{h-Jh^, yhit^^ and are functions of position co-ordinates only, 

and the integrals involving these quantities can be transformed into vanishing 
surface integrals. If one defines by 

= (12) 

with a similar definition for equation (11) becomes 

= 0, (13) 

which is the equation of continuity for the set Ag, is the average velocity of 
a molecule at x$f^ in the set Aj, Ag whose positions are specified. By introducing the 


convective time derivative djdt defined by 

- 

d 9 ^ /_.v 9 ^ /.X 9 

(14) 

equation (13) becomes 


- «• 

(15) 

For cp^ = one obtains after performing partial integrations 

involving W{\h^, and 

on integrals 


(16) 



Kinetic theory of diffusion in gases and liquids. II 475 

where 

It follows from (13) and (14) that 

(18) 

(19) 

( 20 ) 


Combining (18) with (16), one obtains 


where 


with a similar expression for 
An equation similar to (19) can be mitten for the rate of change of 

^ K<'ft.) + — (| ^) • 2 ^) • 


where 




( 21 ) 

( 22 ) 


and is similarly defined. 

One can bring (19) and (20) into still simpler form by introducing 
defined by the equations 


y / J- L<«) U (y f ' 

Equations (19) and (20) then become 

► 


(23) 


dt^ 


where 


|KuW+^{Ej^,Pia+ 24 .PSf 4 -F§>. 

p^l = K^l + >-k;l. etc. 


= etc. 


(24) 


(26) 


The interpretation of these equations is clear. Equation (24) gives the rate of 
change of the average velocity of a molecule in the set whose positions are 
specified; apart from the external force acting on this molecule, there are the tinetic 



476 


L. M. Yang 


pressure force . K^^,, etc., defined in a similar manner as in gas theory, and the 
^ 0 

potential pressure force etc., which consists of forces due to the rest of 

the set Ag whose positions are specified and the set {Ni — h^), {N^ — h^) whose posi- 
tions are not specified. 

One can interpret the pressure tensor even more naturally by considering the 
motion of a representative point in the dimensional phase space. Let 

^^ 1^2 ^ vector in the Z(h^ + h 2 ysi^B.Qe whose components are {i = 1^2, 

andx4^*^ {j = 1 , 2 ,...,A 2 ), the momentum vector whose components are 
and external force vector whose components are and 

kinetic pressure tensor whose components are , Kgi and 

(i, h= 1,2, ..., j, 1=1,2, and similarly for The total pressure 

p _K ,| /ofix 

One has then from (24) 


, M ^ P 1-F 


(27) 


which is an exact analogue of ordinary hydrodynamical equation except for the 
difference in the number of dimensions. 

For one obtains, using (17), 

J Tfl-^ 

( r 

^ (J 3x^'> ■ 

/•3i«i,7is+l) , 1 

+ J -"Ax/i.) . (28) 

Jlj_ 7l2 w 


where 


(29) 


defines the generalized temperature of a molecule at in the set h^. With the 
help of (14) and (19), equation (28) can be transformed to the form 

2^71.+ |:{k0. : ^)<A.+ ■ m^lj + S 


i 


0X^ 


4- 


and is similarly defined. 


(31) 



477 


Kinetic theory of diffusion in gases and liquids. II 

The external force which affects only the mass-motion yelocity and the internal 
forces due to the rest of the set h^, \ do not enter into (30) as one would expect. By 
strict analogy with the gas theory one sees that the vectors together 

the generalized kinetic thermal energy flux connected with a molecule at in 
the set 

An equation similar to (30) can be written for the rate of change of To find 
the corresponding equation for the rate of change of the internal potential energy, 
ones writes for the internal potential energy of a molecule at in the set \\ 


— - 


^«.fti+Ddx^i+D +i 

(32) 

where the factor \ signifies that the mutual potential energy is shared by each pair. 
Using (13) and (14), one has for the rate of change of 


^hlh, + S g^fc) • n^l + S g^g) • HaJ 


iCi) 


where 

s ^ 

k 


+|«Ai,Rs+i^^r- • (33) 


1 r 

g 

with, a similar expression for S combining (30) and (33), one has 


Where = imin,+ \ 

= etc.] 


= 0 , 

(35) 

(36) 


The interpretation of (36) can be given in either of the two ways in which equation 
(24) has been interpreted. Each of and consists of two parts, the kinetic 

and the potential part, while the terms etc., represent the 

dissipation of mechanical energy into heat due to internal friction. 



478 


L, M. Yang 


4. PrEBtrCTIOlSr of the molectjlab conseevatioit equations 
TO OEDINAEY HYDRODYNAMICAIi EQUATIONS 

In an assembly consisting of only one kind of molecules, the generalized definitions 
of the kinetic pressure, the temperature and the thermal flux reduce for — 1 in 
each case to their respective ordinary definitions. This is not so in a mixture where 
the generalized definitions of the above quantities reduce for = 1, yig = 0 to quan- 
tities referring to the first kind of molcules only, which cannot be directly observed. 
What one directly observes in a mixture, instead of the above-mentioned quantities, 
are the average properties contributed by a small group of molecules of various t3rpes 
in a small but finite time interval. Nevertheless, it is possible to find simple con- 
nexions between the two sets of quantities so that either of them can be expressed 
in terms of the other. 

There are two different average velocities in a mixture: the directly observable 
mass velocity u relative to which the peculiar velocities (or thermal velocities) of the 
various types of molecules, and therefore the kinetic pressure, temperature and 
thermal flux in a mixture are defined, and the number velocity used in connexion 
with the transport of molecular properties 


_ Pi^i+P2^2 „ + 

p , - , (^7) 

wnere '^ 2 = (38) 

J ^ J ^2 

fi, and stand for /io(x^, t), t) and respectively. 

Let K, T and m be the kinetic pressure, temperature and thermal flux in a mixture. 
These are defined as 



where Vi = §a)-u, V 2 = |i«-u. (40) 

Ki and nUi are the partial kinetic pressure and the thermal flux of the molecules of 
the first kind, and the partial temperature such that is the average heat energy 

for each degree of freedom of a molecule of the first kind. Kj, and have 
meanings for the second kind of molecules. 



479 


Kinetic theory of diffusion in gases and liquids. II 

On the other hand, one obtains from (20), (29) and (31) for I, — 0 


_ 'Hi r^L7,2 

2K-L10 ““ 2 

^10 = ^J/l ''^10 ^10^^105 

with similar expressions for Kq^, Tq^ and m.Q^, where 
From the above definitions, it can be shown that 


(41) 


— 1^10+ 1^01 

inkT = P(riiIio+n2^oi) + W^i)^+ip2(«0^ 

m = mio + nioi + K^o . xi[ + Kq^ . + pT(%ui + 

where = u^-u, Ug = Ug-u. 


(42) 

(43) 


With these relations between K, T, m and T^q, idlio, etc., it can be shown that 
the conservation equations (13), (24) and (30) reduce for = 1, ^2 = ^ ~ 

^2 = 1 to equations similar in form to the corresponding equations in a gas mixture 
(Chapman & Cowling 1939), these are 


where 


d 

dt^~ 

^1' = 
dt 


3 1 3 . 1/ , ^ 


-U.r- + 

9x 


^ 1 , (»,u;+»,i4)-55(k 4"+4-“) 


+ 3^ {(Pl% • +/^2’^2 • ^ 2 ) (^1 + ‘^2)}> 

Pit]i = %Fi-^.Li, Fi = F<i), 

3i = j«2o ^ • (“iS -«i) d-xf'i + |»ii ^ - (Ud -%) 


(44) 


(46) 


%i stands for t) and ii§[^ is the average velocity of a molecule of the first 

kind at x^^^ in the presence of a molecule of the second kind at X 2 

To compare the equations in (44) with the corresponding equations in the theory 
of gas mixtures, one notices that the part played by the divergence of the partial 
pressure corresponds to the rate of change of momentum of a molecule in a gas 
mixture due to collisions with other molecules. The condition that the total momen- 
tum of the molecules in a small volume element be xmaltered by collisions among 


Vol. 198. A. 


32 



480 


L. M, Yang 


them — a consequence of binary encounter and the hypothesis of molecular chaos — 
is no longer satisfied in the general case, but the rate of change of momentum is 
represented by the quantity , 


^ • Li+ ^ . La = jJ^7iao<ixf + + {J + J , 

which does not vanish owing to the interaction of a large number of molecules other 
than those under immediate consideration. 

S imil arly j the part played in the potential transport of thermal energy corresponds 
to the rate of change of the thermal energy of a given molecule due to collisions 
with other molecules. The condition that the total energy of the molecules in a small 
volume element be unaltered by collisions among them — again the consequence of 
binary encounters and the hypothesis of molecular chaos — is *no longer satisfied in 
the general case, but the rate of change of thermal energy is represented by the 
quantity ( /d \ / 3 \ i 


which does vanish owing to the interaction of a large number of molecules other than 
those at the point considered. 


5. Expansion op the equation op motion 


The equation of motion (9) as it stands possesses solutions far too general for our 
purpose. Following Born & Green, we shall in the following expand equation (9) 
in powers of the gradients of certain physical parameters describing the macroscopic 
state of a liquid mixture. Improvement has been made on the choice of parameters; 
the potential energies of the two types of molecules in the external force field (.Bi, • 

instead of the external forces (Fj, Fg) , are taken to be on equal footing with the partial 
densities, the temperature and the mass-motion velocity, in order that the force 
difiFusion should follow naturally after expansion. 

The normal solution which we require deviates but shghtly from the equilibrium 
solution and therefore can be separated into parts 

Ai hz = hz + Ai hz + Ai ^2 + " • 

such that/^^^^, the zeroth approximation to is that equilibrium solution for 
a homogeneous fluid with properly chosen constant parameters which fits best the 
in the region considered, the difference of the first approximation to 
fhi hz correction for a non-uniform fluid with properly chosen constant 

gradients of the parameters to fit the etc. A natural and convenient basis 

for choosing the values of these parameters and their gradients, is to identify them 
with the values at the centre of gravity of the set Ag. Hence we postulate 


fhj,hz — ^h^hz 


g(i7) y(i) yO) ^ ^ 

iri ,r 2 ,s 


dXi dXjf. 

9x 



where the capital is to denote the new functional dependence. does not 
involve any gradients, involves gradients of the first order and &st degree, 



481 


Kinetic theory of diffusion in gases and liquids. II 

involves gradients of the second order and first degree and also gradients of 
the fcst order to the second degree, etc.; 0Aj/9x, 02Aj/3x0x, etc. (J = 1, 2, 8) 
represent the values of %, u, T, E^, E^ and their space gradients of aU orders at 

the mass centre x = + S ^gX^j j {h^rrix + of the set h^. The quan- 

tities Ti and are the distances between molecules, defined by = x^® — Xi® , 
j.g7) = _ 3 j^ and = x^^ -x^^. That the dependence on x^®, x^ of appears 

to the combinations r«®, rf , s® is suggested by the fact that totermolecular forces 
can be represented as functions of the distance between the mass centres of the pair 
of molecules (at least for mono-atomic substances). As a consequence, one has 


The dependence of J*, on x and t is implicit to Aj, dXfdx, etc. As Aj 
the dependence of on and %f can be made totorms of v® = §i -u and 

The above considerations hold as well for aU quantities derivable from such 
n 5 AT » KPii , etc. (A capital letter is used to each case to indicate its 

nlwS;Snaidep:Mer.)Alsothe^ 

expanded to a similar way provided 


dXj _ /3Aj\° /0AiY /3A^Y , 
0f"\0</ \3«/ \3*/ 


(49) 


is known. It wiU be shown to the foUowtog that (0Aj/0«)® = 0 for aU Aj and (0Aj/0f) 

can be calculated from (0Aj/0i)" from . _ - « /»al. 

When f^ ftj, /4 ft j are found, any of the quantities derivable from /ft^ ftj ca 
culated to the corresponding degree of approximation, for example. 

K.k,-jtLAv 

KiiX - 


I (60) 


XkiT - 
^hih2- 


Xki) . 
'“AiAa ■ 


with similar expressions for 



482 


L. M. Yang 

It' may be mentioned here that since %i 7 i 2 j depend on at the mass 
centre of the set whenever they are integrated over the co-ordinates of one 

or more of the molecules in the set they should be expanded inside the integral 
with respect to the new mass centre of the reduced set, e,g. 

[%x+i, j+di • 

(51) 

where dj = ^^^(x5i^+i^-x), M = (62) 

the curly bracket { } is used to indicate that quantities inside it depend on A^ at the 
new mass centre after integration. 

With these remarks about the method of expansion, we can start to expand equa- 
tion (9). The equation of motion for is 

+ JJ[Ffe+« W] = 0, (53) 


J 9s4« J ex<^ ( 


since the (9Aj/3f),’s in (44) involve no terms not containing 9Aj/0x. The term 

[^(^Aa), in (53) reduces to - vi® . ^ + S of (^8). 

(63) is then seen to be satisfied by the homogeneous equilibrium solution known from 
statistical mechanics 



Substituting (54) in (53), one obtains the typical equation after separating equations 
vanishing independently 


^ a4« +J a4« -n^T ' 


84 “ <“> 

which is satisfied by 


where 


(56) 

(57) 


One sees here that can only depend on T, % and but not on u, 

as a constant motion of the whole assembly can obviously have no effect on 
By means of (50), one obtains ^ * 


<”a. = u = u|)"s„ v(^"ft. = v®, = 

= m^^=0, etc. 


( 58 ) 



Kinetic theory of diffusion in gases and liquids. II 

In' particular, for = 1, = 0 ov = 0, one has 

nl = %, u5 = u = u|, Tl = T = rij 
= m? = 0 = m§, etc. J 

The time derivative Aj) may now be calculated; 

(I”)'— 

po=K'>+Lo, 

K“= Kl+Kl = nkT'[, 

LP =11+11 

Pl^i + P2^2 + 


483 


(59) 


(60) 


{nkT+iT^l) 


(61) 


cmfeTg^.u, <r 

. = iKf-f 

The last three equations in (61) will be proved presently. It has to be poin^ out 
that L? is not the whole of the partial potential pressure to the zeroth order but 
only part of it, the divergence of the rest, which is chiefly responsible for mutual 

diffusion in non-uniform fluids, is This does not vanish because N'^ is 

not symmetric with respect to the different molecules concerned. The sum {N'n + N'n) 
is, however, symmetric with respect to the two molecules concerned, hence 

j{Nk+ mf^ds = 0 

g.n^ L® does not contain terms of this t 3 rpe. 

With the help of (60), one can investigate 

to the zeroth order by expanding (13), (24) and (30). They are afl found to vanish. 



484 


L. M. Yang, 


The expansion of leads to (55). The expansion of may be used 

to obtain the last three expressions in (61), thus: 

+„ . l^±.+^ 1\ ^0 

\3x 0«i^3x 8%2^3 x dT/ 

Comb ining the above two equations, one finds 

For = 2, Ag = 0, (62) becomes 

S-.mcuS'-t®-)} - ri™4.u-^(6j+6;)^, (63) 


where 




Multiplying (63) by and integratiug over r^, one obtains 

||iii4-(^§o(Ur-Ua>')}dr, 

= -I = - J^^.(Jvo,(Ug>'-u)}cZx?) 


where 


Similarly, it can be shown that 









Kinetic theory of diffusion in gases and liquids.* II 


485 


where 


re 

Adding (70) and (71) together, one obtains 


Hence it follows that 
where 


(di+50 = -crnkT^.u, 


<r = 


VI 




J = /tt)+j(a+/a)+4» 

a 


The last three relations in (61) are thus proved. 

To obtain the equation of motion for we first write down the first-order terms 

of (9) after expansion; 

/ 9^2/, ^/^3 \/ T 9-yRl^. , Q 

-\K{\hi),fj^^-h^ (“"^S/’Kax dn^ a»2 


9 ^ ■' ay/ T ^Nhiht , Q 

yin, dT — 




T j 


jpa 

[W{hh) + E{h^h,),h,nJ' = + 

(J Ax+l.A,]^x^+«<i5^+") 

= n.^.x+d2. s 


where 


^ Jf-l-mi' • 


i J 

Q = K^h S ’”•2 S 4^) . dj = 

Jf = A = Ai4-»W2- 



486 • L. M. Yang 

Combining the above results and. rearranging terms, one obtains 

, G ( ^ ^ I ^ ^ ATO ^ /^% ^ StIo 9 \ .r 




SMkTj 


-[II'(W.^'i.».l+JJ[nS‘>,(i'w.,ii.+<i,.2^Aj^^j^^jJ^x^+»dgk+i) 

where G^ - SwiVi^, G^= ^ + and bisthe symmetrical, 

non-divergent part of ^u. For = 1, Aj = 0, equation (72) becomes 

pn f 1 3wx 1 3ra 1 /8% 3 Sjio 9 \ _J m 

^MPi9x pax pifeTl3x^i+^^)-^j-”^i'^i+-^2j^ViVi:b 

+ n I -^iKvf 5 p-wmj miaf^oja^r , /^p- x 

n V)3 x-"+t(2^-2+-7 

whJj3^«‘a§Lw{'^2®'^ 2 

equation after substituting the known solutions 
otJfio and FJi, and perfomung mtegration, becomes 

yf^J_] ro ^1 L\ (miv^ 5\3r 
nikT' yXdxdXj ^ TnjkT\2kT 

_F»-:p-teW'v • b F" i\ 9 

'ih.kTXkTJ 1 ■^^»ife3'\3ife2’ 73x'“' 


( 74 ) 



487 


Kinetic theory of diffusion in gases and liquids. II 

Combining (73) and (74), one has 

where a = ^ - — is the difference of acceleration of the two kinds of molecules m 

mg mi 

A similar equation can by symmetry be written down for Joa. -^ii 
(75) is an exact analogue of the corresponding equation in gas theo^ (cf. Chapman 
& Cowling 1939 ). Though (76) is successful in expressmg the rate of change of A 
terms of Ao aS Ai corresponding to the collision integral m the gas theory, yet in 
contrast to the latter where the statistical effect of the predommant 
interaction (binary encounters) has been explicitly taken into account, ° ^ 

formal value as it stands. However, there are two possible ways of solymg 
problem. The first is to proceed to equations containing more than 
L the set h K and use approximations similar to Kirkwood s approximation to the 

triple distribution functioninterms of the pair distributionfunctioiis,thm^^^ 

cloLd equations; but the complications that arise for a mixture are likely to b 
considerable. The second and much more practical way of approach consist m 

modifying the binary encounter in a manner to be explamed in the o owmgsec ion. 


6. Discussion the method oe solution 
The assumption of binary encounters to which the whole theoiy of 

extensive developmentlosesits validity as the densityincreases. 

mSrrefined coLiderations than the binary encounter — 
introduced; as the consideration of multiple encounter is impracticable if not 
possible we propose here to modify the binary encounter assum]^on so as to take 
into acc^mnt the influence of the presence of the other moWes. We 
is possible to account for the average relative trajectory of a paar of molecules m 
a hquid during a period of intense interaction with each other by mea^ of an 
apparent poteLal between the pair. This assumption k sugg^d m the^i^ 
liErn state by the fact that the equilibrium pair distribution function Ao ca 

written in the form 

I . — 


/§0-»20(^)i 



488 


L. M. Yang 


wHch serves to define the apparent potential From this equation, one sees that 
the distribution function 71 %^ is attributable to an apparent potential between 
the pair which is independent of their velocities. From ( 55 ), we have for = 2 , Ag = 0 





M 


from which 56* can be found with the help of Kirkwood’s approximation (E, A. Rodri- 
guez 1949). 

The trajectory thus calculated with the use of 56* would represent the average 
relative trajectory of the pair in the presence of the other molecules in the equi- 
librium state for each configuration of approach specified by the impact parameter 
b and the relative velocity g = | j. Though in the non-equilibrium state, the 

distortion of the distribution function may give rise to modifications of the apparent 
potential we shall, as a first approximation, use the defined above to calculate 
the average trajectory of a pair in a non-equilibrium fluid. 

The use of enables one to reduce a many-body problem to a two-body problem 
in an approximate manner. The right-hand side of ( 75 ) can now bo transformed, 
the typical integral being of the form 



JJ 




The integration over needs only to extend over a sphere of influence of radius 
Rq within which is appreciable. We shall first express /20 in terms of a product of 
two/i’s for the two molecules concerned. At time t let the velocity of a molecule at 
be and that of a molecule at xf^ be Using , one can calculate the tra- 
jectory of a molecule at x^^^ within the sphere of influence Rq relative to the one at 
x^,^^ for every given relative velocity g = In particular, one can calculate 

the position and velocity co-ordinates and of the pair at time t', 

when they last approached within distance Rq of each other, as functions of x^^^ 
(In fact, only the velocities and at f as functions of x^^^ r^, 5 ?^ 

and are required, as wiU be seen presently.) 

One can then write as the limiting form off^o 




The first equality holds because by using etc., Liouville’s theorem may be 
assumed to hold for the pair, and the second equality follows from an application 
of the law of probability governing independent events. Each of the /^’s on the 
right-hand side of the above equation can be expanded in the manner 


A=/?+/i+... 


where f{ represents the deviation from a Maxwellian distribution. We have then 
for/20 after expansion 







489 


Kinetic theory of diffusion in gases and liquids. II 

The first term of ) which equals 

already been taken into account in (74) in a different manner. For the factor 

in (75), one has then 

{FU = FS(v<«\A,)Fi(vf + 


(76) 


where = §i«-u(x?5), V®' = 5p-u(x?>) and the Aj’s and 0Aj/ax’s always refer to 
the value at xi® as indicated by the curly bracket. 

The right-hand side of (75) now becomes 



where Tf>' - 5f’'-n{3^’) and vf - ??>■-»(*?’) ^ s, and 5, 

oalenlated *om an apparent potential f between a pail of different moleenlea m 

the same maimer in which and are obtamed. , . , . . j?' 

A closed integral equation for F', is thus obtained. Instead of solv^ for F^, one 
needs only to find its first and second moments in the velocity for the purpose of 
calculating the coefficients of diffusion and viscosity. The problem now co^ists m 
first findii the apparent potentials etc., then calculat^ the trajecto^ and 

SpressingV. - ft-°tions of ??>, If and |f 1^ as £'/tv 

and finaUy solving the integral equation (75) with its nght-hand side replaced by 
(75'). In tto way, one can regard (75) as a generalization to Boltzmann s equation 
in gas theory. The solution of this integral equation remains to be comple . 


7. General BXBEESSIONS FOB THE COBinCIBNTS OF OBDESTABT iSD THEEMAI, 

DIFFUSION AND OF VISCOSITY IN LIQUID MIXTURES 

The quantities required for calculating the coefficmnts of ordinary and theriml 
diffusion are F( and F'^, those for calculating the coefficient of viscosity are F^, F^ 
^N^7to From qffite general considerations the form of the above quantities 

Fi and F^: they are linear in dXjjdx and satisfy the conditions 

= Jf^^Vi = 0, 712 = jFadVa = 0, 

pu' = piUi-f-iOaU4 = whjFiVidVi+m2jF4v2dV2 = 0, ^ 

^^mijFi«idVi-i-m2jF4t|dv2j= 0, 



490 


L. M. Yang 


which reqTiire that any further correction to the Telocity distribution and 
si1ifl.n not alter the Talues of the local parameters Aj. (For the present purpose, the 
presence of an external force field will not be considered.) Hence and F^ must be 
of the form . 


-Fi = b, 


0x ■ 

^+a,(2)v ^ 


0X 

dT 


iVi: b, 


(78) 


where the q>iS are functions of Vi, %, ^2 functions of Vg, n^, and T, 

and 

m 

Next consider ^ 20 * is linear in 3Ay/0x, even in and satisfies JiV'godri = 0. 
Hence it must be of the form 


(pfv\dv2. = 0 (i = 1, 2, 3). 


(79) 




( 80 ) 


where is a function of ^2 Similarly, one can write down for 

(81) 

The forms of iV^x and iV’xi s^re, however, not so simple, since neither of them is 
even in the relative position vector or symmetrical with respect to the two molecules 
concerned; but the sum (-^ 11 + ^ii) is an even function of and the 

difference an odd function of s^. Hence 

iV^n+iV^li = 24 >SxSx:b, 




It follows firom the above equations that 


•^11 — ^ + + SiSi:b, 


ax ' '““"'ax 

Nix = i'u^®2-^+''u’®2-^+J^u?®2-^+r$¥s2S2; b, 


(82) 


where Sg = — s^. 

With these general expressions in hand, it is easy to find the coefficients of diffusion 
and viscosity in terms of them. Consider diffusion first: the mutual diffusion velocity 

“ , , CFi , fj' ^ 


As the coefffcients of diffusion are measured under constant pressure, it is convenient 
first to transform the gradients dnjdx, dnjdx and dTjdx appearing in the formal 
expressions of F'^ and Jg dcjdx, dp/dx and dT/dx, according to (19) in part I 
of the present paper with p given by P® in (61). Thus 


F[ = Vi . #ViVi : 




C83) 



Kinetic theory of diffusion in gases and liquids. JI 


491 


where 






■1 ^pO 


9%/’ 


0(3) _ 0,(3) _ , 

-9’i 321 


(84) 




A giT»ila.r transformation is applied to F'^. The mutual diffusion velocity (Ui-Ug) 
can be expressed as 

where the term involving dfjd^ can be dropped under the assumption of constant 
pressure. Upon comparing (85) with the usual definition of the coefBeient of ordinary 
diffusion Do and thermal diffusion Di 

, , if) 

onefli* 1 

Next consider viscosity: the pressure tensor in the presence of linear gradients 
0Aj/3x is p = p“ + pS where p“ is given by (61) and p' by 

p' = K' + L', 

K' = mijDiViVidVi+majj’^VoVadVa 
= ■^|mij<)4*^ufdvi+maj93i*^^)|dv2| b, 

where the relations (50), (46), (83), (80) and (82)* have been used. 


( 87 ) 



492 


L. M. Yang 

According to the definition of the coefficient of -viscosity /t 

p' = -2/{b, (88) 

one has therefore 




(89) 


■where the first part is the potential -viscosity and the second part the kinetic -vis- 
cosity, the former playing a predominant role in liquid mixtures. As mentioned in 
§ 2 of part I, diffusion is certainly the simplest transport process of all, as it involves 
only the kinetic transfer of number densities. The present calculation shows that 
only F'j_ and are required for obtaining the coefficients of diffusion. The practical 
method of solution for (75) discussed in § 6 indicates a possible way of solution to the 
problem of diffusion in liquid mixtures which has hitherto been regarded as almost 
impossible. 


The author -wishes to express his thanks to Professor M. Bom for his interest and 
encouragement and to Dr H. S. Green for many helpful suggestions and discussions 
and for reading the manuscript. 


Refeebkces 

Bom, M. & Green, H. S. 1946 Proa. Roy. Soc^ A, 188, 10. 

Bom, M. & Green, H. S. 1947a Proa. Roy. Soc. A, 190, 455. 

Bom, M. & Green, H. S. 19476 Proc. Roy. Soc. A, 191, 168. 

Chapman, S. & Co-wling, T. G. 1939 Tfie mathematical theory of non-uniform gases. Cambridge 
University Press. . “ 

Frenkel, J. 1946 Kinetic theory of liquids. Oxford University Press. 

Kirkwood, J. G. 1946 J. Ghem. Phys. 14, 180. 

Kirkwood, J. G. 1947 J. Ghem. Phys. 15, 72. 

Kodriguez, E. A. 1949 Proc. Roy. Soc. A, 196, 73-92. 



The behaviour of continuous stanchions 


By J. F. Bakeb, M. R. HonisrE and J, W. Rodeeick 

Department of Engineering, University of Cambridge 

{Communicated by Sir Geoffrey Taylor, F,R.S, — Received 17 December 1948 — 
Revised 14 March 1949) 


A description is given of tests carried out on small-scale steel stanchions of rectangular and 
IC -section subjected to the arrangements of load encountered in building frames. Two maia 
types of loading are distinguished according to whether the stanchion is bent in single or 
double curvature. A theoretical explanation of the restilts observed is sought by reference to 
the simple plastic theory in which it is assumed that sections plane before bending remain 
plane after bending. The theory of members subjected to combined bending and axial load 
in the partially plastic range is developed, and is applied to the case of single curvature 
stanchions. The growth of the plastic zones is traced up to the stage at which complete 
plasticity occurs at three sections in the stanchion, and satisfactory agreement is obtained 
between the theoretical and observed collapse load. When the simple plastic theory is applied 
to double-curvature bending, inaccuracies arise in certain cases due to strain reduction in 
the plastic zones, and the simple theory is therefore elaborated to take account of the irrever- 
sible nature of plastic strains. The improved theory is then applied to' appropriate cases of 
double-curvature bending. 

1. Introditction 

Despite the fact that pin-ended struts are rarely used in practical structures, in- 
vestigators have confined their attention almost entirely to the behaviour of this 
simple form of compression member. Various attempts have been made to apply 
the knowledge thus gained to the design of the more usual member which is con- 
tinuous through a number of stories or panels. In Great Britain it is usual to treat 
each length between floors or panel points as equivalent to a pin-ended strut of 
some effective length depending on the rigidity of the members connected to the 
continuous compression member. This has been shown to be irrational (Baker 
1934). A full description of the loading conditions to which a continuous stanchion 
in a building frame can be subjected, and a rational method of design, have been 
given in the Final Report of the Steel Structures Research Committee (Baker & Holder 
1936; Baker & Williams 1936). That investigation was, however, confined to the 
elastic range ; little or nothing is known of the conditions under which collapse occurs. 

An account is given here of an experimental investigation on small-scale steel 
frames wdth rigidly connected members to study the collapse of stanchions under 
various conditions of loading, A description is also given of an analysis which enables 
the behaviour of the stanchion to be followed, not only in the elastic range, but also 
in the plastic range right up to collapse. 

2. Tests on stanchions bent in single anb nouBiiE ctjevattjbe 

In general, a stanchion length in a bui l ding frame will be subjected to axial load 
and end-bending moments. In the majority of caSes the end-moments will be of the 
same sense, so bending the stanchion in double curvature, but in some oases the end- 

[ 493 ] 



494 J. F. Baker, M. K. Horne and J. W. Roderick 

moments can be of opposite sense, so bending the length in single curvature. In the 
tests carried out these conditions have been simulated in the frames shown in figure 1 . 



Figure 1. Small-scale test frames for stanchions bent in 
(a) single and (6) double curvature. 

When the beams are loaded the stanchion at a is bent in single curvature and that 
at 6 in double curvature. These frames were tested in special loading gear, in which 
a given load was first applied to each beam. The test was completed by applying to 
the stanchion an increasing axial load until collapse occurred. To ensure that failure 
was confined to the stanchion the beams of each test frame were of heavy rectangular 
section, IJin. deep x f in. wide, and were of high tensile steel. The stanchions them- 
selves were of normalized mild steel, the cross-sectional dimensions of each of the 
four types tested being shown in figure 2. 



B P 

Figure 2. Stancbion sections. 


The test results for rectangular and X-section stanchions were similar in character. 
Typical load-deflexion curves are shown at a and 6, figure 3, for the heavier stanchions 
of rectangular section tested in single and in double curvature respectively. At a the 
central deflexion of the stanchion relative to its ends has been plotted against axial 
load, and for the double-curvature case 6, the deflexions are those at the quarter 
points. In both oases the range &A represents the deformation produced by beam 
loading only, and that beyond by the additional axial loading. In figure 4 values of 



495 


Behaviour of continuous stanchions 

the axial load to cause collapse (curve 1) and those to develop an extreme fibre 
stress equal to the lower yield value for the material (curve 2) have been plotted 
against beam loads for those frames having stanchions of heavier X-section. The 
curves at a refer to the single-curvature tests and those at b to the double-curvature 
tests. The horizontal line (e.g. at 8*03 tons in a) represents the load which would 



Figure 3. Typical load-deflexion curves for stanchions of IJ x f in. rectangular section 
subjected to (a) single- and (5) double-curvature bending. 



a b 

Figure 4. Relationship between axial load in stanchion and total load on each beam for 
stanchions of heavy x-section, subjected to (a) single- and (fe) double-curvature bending. 
Curve 1, axial load at collapse. Curve 2, axial load at beginning of yield. 

produce failure of the stanchion in pure compression, the corresponding lower yield 
stress being marked on curve 2. In both single and double curvature, particularly 
in the latter case, as the fi:ames were subjected to heavier beam loads, the axial load 
to develop lower yield stress fell away much more rapidly than the collapse load. 
This is of particular significance in the de^gn of rigidly jointed structizres, since 
curve 2 represents the axial load which is usually regarded as the safe limit for 


, Vol. 198. A. 


33 






496 


J. F. Baker, M. R. Home and J. W. Roderick 

a continuous stanchion. For instance, from the curves at b, for a frame carrying 
a beam load of 2-5 tons this limiting axial load is 0-44 ton, whereas the stanchion 
actually withstood a load approximately 17 times greater. 

3. The equations oe bending fob a bbctangulab stanchion 

STBESSED BEYOND THE YIELD POINT 

Before the collapse load of a continuous stanchion can be determined analytically 
it is necessary to know the distribution of stress across a section produced by bending 
moment and axial load of sufficient magnitude to cause yielding (Roderick 1948 ). Let 



Fi 00 ee 5. Idealized stress-strain relationships of mild steel. 



load for the elastic state (figure 6) and various degrees of plasticity (figures 7 to 9). 



Behaviour of continuous stanchions 497 

the stress-strain relationship for the material be that given in figure 6 a, where/ ^ and 
/i represent the upper and lower yield stresses respectively. Referring to figure 6, 
consider the section a of a member subjected to an axial, compressive force P and 
bending moment M, then for elastic conditions, 

P = (1) 

and M=fZ, (2) 

■where A is the area, and Z the modulus of section. The resultant stress distribution 
b is therefore made up of a uniform compressive distribution c and one of yarjdng 
bending stress d having an extreme value/. It is evident from the resultant distribu- 
tion that yield will occur first at the extreme fibre where the stress isf+Pi- It is 
assruned that, on increasu^ the straining action beyond the point where yield occurs , 
plane sections remain plane and that each fibre as it reaches the upper yield stress 
f^, drops back to the lower yield value /i, and thereafter remains constant, the con- 
tention being that the fibres are never sufficiently extended to cause strain hardenmg. 
Accordingly, as in figure 7, after yield, a resultant stress distribution b is obtained 
where the material at the one extreme fibre is plastic while at the other, the stress, 
having decreased, eventually changes sign. If now the axial stress is deducted 
from the total distribution, the bending stress distribution d is obtamed. The con- 
dition represented by the diagrams of figure 7 ceases to apply when the resulta,nt 
stress/, -Pa eventually reaches the upper yield value for the material, so giving rise 
to the distributions shown in figure 8. This is the last stage in the development of the 
moment of resistance, andinthe course of it more and more fibres on both the tension 


until finally the whole section is in a sta'te of plasticity (figure 9). 

In expressing the stress distributions of figures 7 to 9 analytically, varmus con- 
ditions have to be satisfied. The total axial load P acting at the section is balanc^ 
by the uniform stress distribution shown in figures c, and three equations sunilar to 
equation (1) are obtained. The stress distributions in figures d therefore represent 
bending moment only, and equating the total axial force to zero, 

dce{ifL+P)-^t- (/i~i’)'^c} = 


where p = mean axial stress, 

At, A^ = area of plastic and elastic regions respectively on the tension side, 

Ae, Ay = area of plastic and elastic regions respectively on the compression side, 
= distance of centre of gravity of tensile elastic region from the neutral axis, 

and = dfetanceofcentreofgravityofcompressiveelasticregionfromtheneutral 

axis. 


The above equation refers strictly to figure 8d only, but corresponding equations 
for figures Id and 9d can be obtamed by omitting appropriate ^rms. 

If the bending moment acting on the section is denoted by If, then it follows by 
taking moments about the neutral axis in figure that 

M = (/u-p) ^6+ (/£-P)'^Iw+ (/i+P) V 


33-2 



498 J. F. Baker, M. R. Home and J. W. Roderick 


where = IJd^, Ig being the second moment of area of the elastic region, 

Z^f = the first moment of area of the plastic region on the tension side, 
and Zpg = the first moment of area of the plastic region on the compression side. 

Ag ain, corresponding equations for figures Id and 9d may be obtained by the 
omission of the appropriate terms. 

All the equations given above refer to a member of any section which has an axis 
of sjTnmetry in the plane of bending. Attention is hereafter confined to members of 
rectangular cross-section. The width will be denoted by b and the depth by 2d. 
Equation (1) becomes p ^ 26d^>, (5) 

and is applicable to all stress distributions. For the stress distributions in figure 6, 
the expression for the bending moment as given by equation (2) becomes 

M = ( 6 ) 


The stress distribution in figure 7 dJ is adequately defined by the values of and 

Expressed in terms of these variables, equation (3) leads to 


= 


2d 


where 

and 


{av2+«; + (l — a)}’ 

A 


(7) 


V = 


a = 


ifu-p) 

ifu-p) 


2{fL-py 

It is found convenient to introduce a parameter ^ such that 


secji = 

It then follows from equation (7) that 


2av+l 
2a— 1 ’ 


d^ — 


Sad 

(2 a- 1)2 


COt2^. 


(B) 


By substituting in equation (4) it is ultimately found that 

M = |M®^^3;^{3(2a-l)(seo2^-2)sec2^-4(sec2^-3)sec^-t-(6a-l-6)}cot*^. 

(9) 

The stress distribution in figure 8 d is adequately defined by the values ofp,dg and 
If equation (3) is expressed in terms of these variables, it is found that 


JL 


P ^ 
ifu-P)'^' 


( 10 ) 


The bending moment given by equation (4) may be expressed in terms of p, dg 
and dgg, and after substitution for dg from equation (10) it is found that 




fz 


^{fu-py 


bdL 


( 11 ) 



499 


Behaviour of continuous stanchions 

The stress distribution in figure 9dl corresponds to the condition of full plasticity, 
and is the Hmiting case of figure when = 9- The value of the bending moment 
at full plasticity is therefore given by substituting = 0 in equation (11). 

It win be seen from figures 6 to 9 that when a member is subjected to bending 
about one principal axis combined with axial load, plastic zones can be formed as 
shown in figure 10. In the length AB^ to be referred to as the elastic length, the stress 



distributions are of the type shown in figure 66. At section YY yield stress is just 
reached in the extreme compression fibres, and hence from figure 6, /= {fxj—p)- 
The bending moment then has the particular value My as given by substituting this 
value off in. equation (6). Hence 

My^l{fu--p)bdK ( 12 ) 

In the length BO, to be referred to as the primary plastic length, the stress distribu- 
tions are of the type shown in figure 7 6. At section TT, yield stress is just reached in 
the extreme tension fibres, and hence from figure 7,/^ = (/^ The bending moment 

has the particular value Jfefy ^ given by substituting this value off I in the expressions 
leading to equation (9). It is thus found that 


= f-^^{fv(fx,-P) + ^fLP}bdK (13) 

Finally in the length GD, to be referred to as the secondary plastic length, the stress 
distributions are of the type shown in figure 86, tmtil complete plasticity is reached 
at section PP. The value of the bending moment at this last section is readily 
obtained by substituting = 0 in equation (11) and hence 


Jl 


( 14 ) 



500 


J. F. Baker, M. R. Home and J. W. Roderick 


Having obtained expressions for tbe moment of resistance at any section, it is 
possible to derive slope-deflexion equations, separate sets of equations being neces- 
sary for each of tbe dastio and plastic lengths. It is convenient to take rectangular 
axes such that the Ox axis is coincident -with the line of action of the resultant forces 
acting on the member. Thus Ox in figure 10 is coincident with the line of action of 
those equal and opposite forces which are the resultants of P, if at J. and P, 
at D. The position of the Oy axis is arbitrary. It will be assumed that the shear forces 
in the member are everywhere small, so that the angle between Ox and the longi- 
tudinal axis of the member is small. Hence if the radius of curvature R of the centre 
line is large compared with the length of the member, 


d?y 1 

dr® P’ 


(15) 


where (r, y) are the co-ordinates of a point on the central axis. If M is the bending 
moment at any section, then 

if = Py. (16) 


Considering first the elastic length, it follows fi:om the stress distribution, figure 6 d. 


that 


l-J- 

R~ Ed' 


(17) 


where E is the modulus of elasticity. It therefore follows from equations (6), (16), 
(16) and (17) that 




(18) 


where I is the moment of inertia of the cross-section of the beam about the axis of 
bending and has the value \bd^. Let cc and dyjdx have the known values % and 6-^ 
at any section 8-^ in the elastic length, and let the bending moment there be 
If equation (18) is integrated and the constants of integration evaluated by reference 
to point /Si, it is readily shown that at any section, 


dy ^ . M-i 

^ == ^iCOsa(a;— iCi)— a’^sina(a;— a?i), (19) 

1 M 

y = ix(x - a^i) + COS oc{x - Xj), ( 20 ) 

Ot JL 


where 




El' 


In the ca^e of the primary plastic length, it follows from the geometry of the 
stress-strain diagram in figure 7 d that 


1 ifu-p) 
R Ed^ • 


( 21 ) 


The combination of equations (9), (15), (16) and (21) leads to a differential equation 
which may be integrated once. Let x and dyjdx have the known values of x^ and 6^ 



Behct/viouT of continuous stanchions 

at any section in the primary plastic length, and let the paranieter ^ 
value 4>z- Then the constant of integration is readily evaluated, and it is touna tna 

g = V[^|-yg(e-e2)], (22) 

n _ ^ ifu ~fl^ (fh ~P) 

where ” ~ 3 Ep ’ 

e = 4cot^ cotl^+sec^, 

62 = 4 cot ^2 + 

Equation(22)cannotbeintegrateddirectly,anditis necessary toresorttoagraphical 

solution If however, no account is taken of the upper yield stress (figure 5b), the 
various expressions wiU be simplified by putting/^ =fj,. In these circumstances, 
equation (9) reduces to 

v+1 “ 

where M, has the value given in equation (12). The differential equation obtained 
from equations (15), (16), (21) and (23) can he integrated twice without difficulty. 
If the value of V at is denoted by T2, then 

^ ~ 

® c\f^—p)\\y+l i? 2 +lj 0 ^ {c-0){c+di)j 


where 


^"3 Ep 
c = +/*(!+ '*^2)]* 


Considering the secondary plastic length, equation (21) stffl applies, a,nd le^s, 
in oombiMtiol mth equation (II), (16) and (16), to . diffotentM eqimtion winch 
may he integiated directly. If the values of r and dy/ife me r, and «, at some point 

fifginthesecondaryplasticlengthwhere the value of dee is given 73 , eni is oun 




3^ J \jn${fxj — p). 


-J {(2 — sd^) f (sdg + 1 ) — (2 — sdfg) c / {sd^ + 1 )} > (21 ) 


where 


'fU^h-^fuY 

• 4(/p- —p) 

{Emdl - 4(/o— p) ds) * 


and 



602 J. F. Baker, M. R. Home and J. W. Roderick 

4. The theoretical behaviotjr oe a stahohioh 

BENT IN SINGLE CURVATURE 

To show the use of these equations a single-curvature frame will be examined 
(figure 1 a) having a stanchion of the section denoted by A in figure 2 and made from 
a steel with an upper-yield stress of 22-88 tons/sq.in. and a lower-yield value of 
20*32 tons/sq.m. Each beam carried a total load of 1*99 tons, and collapse occurred 
when the axial load in the stanchion was 6-67 tons. Erom ordinary elastic theory it 
was found that yield must have occurred in the extreme fibre of the section at the 



after peld in compression b"" - -' h overstrained in compression and 

.after yield in tension su^ecgently stressed below the . 

Figure 11 . The deflected form of a stanchion bent in single curvature at 
various stages of loading, showing the extent of the plastic zones. 

centre of length of the stanchion when the axial load was 3-98 tons. The calculated 
deflected form of the stanchion at this stage is shown in figure 11a, the deflexions 
being drawn to a scale 12*5 times the linear scale. The corresponding deflexion at 
the centre G (figure 11) is denoted by iV in figure 12, where this quantity is plotted 
against the axial load in the stanchion. 

After 3 ?ielding has taken place further calculation is only possible with the aid of 
equations (19) to (27). A process of trial and error has to be introduced. For a given 
axial load a value for the bending moment transmitted from beam to stanchion is 


Behaviour of continuous stanchions 603 

assumed. This makes it possible to calculate the change of slope at the end of the 
beam, and with this as a starting point the lengths of the elastic portion WB (figure 
116) and of the primary plastic length BC of the stanchion can be determined. When 
the value of the moment at the connexion has been chosen correctly the sum of the 
lengths E'B and B C will be equal to the known half-length (4-375 in. ) of the stanchion. 
The diagrams c and d in figure 11 were obtained in a similar manner. 



Figure 12. Comparison of observed and theoretical relationships between total axial 
load and central deflexion for a stanchion bent in single curvature. 

It will be seen from the information given above each diagram (figure 11) that the 
end-moment {M^) decreased steadily with increase' in axial load. In the test this 
had the effect of preventing the growth of the plastic zone along the length of the 
stanchion, despite the fact that its depth was increasing continuously at the centre. 
The effect was so marked for an axial load of 6-55 tons (diagram d) that some part of 
the yielded material at the extremities of what had been the plastic zone must have 
been subjected to stresses lower than the yield value. These regions are indicated 
according to the key in figure 11. 

From these calculations it was found that yielding occurred on the tension face 
of the stanchion at the centre of its length at an axial load of 6-75 tons. This was 
followed shortly afterwards, at an axial load of 6-78 tons, by yielding at the ends of 
the stanchion on the compression face. In diagram e of figure 11 it will be seen that, 
at an axial load of 6-82 tons, there is a primary plastic length E'B', an elastic length 
B'B, another primary plastic length BO', and finally a secondary plastic length O'C, 




504 J. F. Baker, M. R, Horne and J. W. Roderick 

Under these conditions the calculated central deflexion of the stanchion has the 
value indicated by S in figure 12. 

Any attempt to consider a more advanced condition of plasticity fails , to give 
a solution, unless in making the calculation the axial load P is assumed to have 
a value less than 6-82 tons. Proceeding in this manner it is possible to show that the 
load deflexion diagram beyond the point 8 is given by /SP in figure 12. In this range 
the depths of the plastic zones at C (figure lie) increase until the tensile and compres- 
sive zones meet and the section becomes completely plastic. Thereafter it is necessary 
to assume that what has been called a * plastic hinge’ occurs at this section so that 
further straining results in rotation. At the same time the degree of plasticity in- 
creases at the ends until yield also occurs on the tension face, the corresponding value 
of the axial load being 6-37 tons, and complete plasticity occurs at the ends under 
an axial load of 5*91 tons (figure 11/). 

In comparing these theoretical results with the observed values (figure 12) it 
must be remembered that, since dead loading was used in the test it was not possible 
to reduce the axial load so that the range ST could be followed. Collapse therefore 
took place at an axial load (observed to be 6*67 tons) corresponding to the point S, 
The observed deflexion points are seen to be in fairly good agreement with the 
theoretical curve up to the collapse load. 

A subsequent calculation assuming fu—fz showed, that this approximation 
introduces little error in deflexion or collapse load. 

This simple theory, however, is not satisfactory for double-curvature bending 
when plastic zones are developed in the course of beam loading. When axial load is 
applied the resultant decrease in end-moment reduces the strain in the plastic 
zones, the corresponding stress-strain relationship being represented approximately 
by CM in figure 5 6, while neglect of the effects of overstrain is equivalent to assuming 
that plastic strain is reversible and represented by OJS'O. In order to develop a more 
exact analysis the plastic stress distributions at every section have to be modified 
at each stage to take account of their strain history. The principles underlying this 
process are demonstrated in § 5 below. 


5. The equations of bending fob a bectangxtlab section when 

STBESS REVERSAI. OCCURS IN THE PLASTIC RANGE 

The process of following the changes in stress of all the fibres in the cross-section 
of a stanchion stressed beyond the yield point is complicated, since a large variety 
of cases arise. It is therefore proposed to consider one example only in demonstrating 
the method which has been used. 

Consider a member of rectangular section (figure l"3a) of width b and depth 2d 
subjected to the forces shown in figure 10. If it is assumed that/jj =/^ the stress 
distribution in the secondary plastic length will be that represented by GH CDJEF in 
figure 136. As long as the straining actions are increasing after the yield point has 
been reached in any fibre, aU subsequent changes of strain are of the same sign and 
no decrease of stress occurs. The relationship between M, the bending moment, P, 



Behaviour of continuous stanchions 605 

the axial load, and i?, the radius of curvature in the plane of bending, may therefore 
be obtained from equations (5), (11) and (21) above. The line CD (figure 136) repre- 
sents the stress distribution in the elastic part of the cross-section, and hence the 
same line cd produced to a and 6 as in figure 13 c represents, to some scale, the longi- 
tudinal strain over the whole section. Let the extreme fibre strains in compression 
and tension, represented by points a and 6, be denoted by and respectively. The 
line CD in the stress diagram b may also be extended to A and B, giving intercepts 
of length and respectively, then 

go^Be^, gt=-Eet and f^^ Bsf. 

Now consider what happens when the appHed bending moment changes to 
and the axial load to in such a way that the longitudinal strain in all fibres of the 
cross-section is represented by the line in figure 13c. Let a^b-^ intersect ab at 
a point 0 between a and c. Let the extreme fibre strains in compression and tension. 




Figure 13. Stress-strain diagram for a member subjected to stress 
reduction in the partially plastic range. 

as indicated by points % and be denoted by and respectively, where 
and The line ^iPi may also be drawn in figure 136, the intercepts and g^^ 
being given by g^^ — and gt^ = Ee^^, The material between A and 0 is now sub- 
jected to a positive tensile change of strain, and that between 0 and J5 to a negative 
compressive change of strain. Since the material between A and 0 is initially stressed 
beyond the yield point in compression, this positive strain produces a decrease in 
stress which, since it takes place elastically, is represented by AOA-^ in figure 136. 
Between 0 and C the strain consists of plastic deformations and there is no change 
of stress. Between C and material initially elastic is stressed beyond the yield 
point in compression, while between C^ and D the material remains wholly elastic. 
Between D and B material initially stressed beyond the yield point in tension is 
subjected to a n^ative strain, and the stress is therefore decreased elastically. The 
change of stress in all fibres between G and B is given hj GG^B^B in. figure 136, and 
the final stress distribution takes the form GH^O^CiJE^F in figure 13d. 

Owing to the change of bending moment and axial load, the radius of curvature 
in the plane of bending changes from jB4o In order to determine the relationship 



506 


J. F. Baker, M. R. Home, and J. W. Roderick 

bet-ween M-y, and it is necessary to consider the change of stress distribution 
in greater detail. Let 


B 


2d 


e = ■ 




2d ’ 

2 ’ 


and <T=ief—e)B, = (ef- e^) Bi- 

Also, it is-found from the geometry of figure 13 c that 

y^ = d+(r, 
y<^ = d+(ri, 

o-iPi-o-/?! 


yo = <i+- 


(28) 


Pi-P J 

By considering the change in stress distribution given by the areas OAA^ and B OC-^ 
in figure 13 6, it follows that 

P^-P = b{BCGiB^- OAAj}, 

and if - Jfi = ^{y^igt +fL) {d - ^c) - ye^iSh +/i) (d - i2/ci) 


+ (2d - y^) {g^ -g^^){d-\U- y^)}. 

Using the values of y^, and y^ given by equations (28) it is found that 

= ^^^^-4d{<r,p^-crp), 


(29) 

(30) 


(31) 

(32) 


where 


Pi-p 
2bdE ’ 


^ oi.jnj' > y 


Z{M-^-M) 
2bd^E ■ 


If M, M^, P, Pj,, p and cr are known, these equations determine the values of p^ 
and cTi- Unfortunately, they can only be solved by trial and error, but if (M-^ — M) 
and (Pi— P) are small compared with M and P respectively, a direct solution is 
possible. Let Ifi = AT + SM and ij = P + dP. Then equations (31) and (32) become 


S SP _ is<7\ ,, 

-Ms)”""’ 

(33) 

6 8M JSa-Y o JSa-Y 

bE Sp \5p) (dp) ' 

(34) 


By eliminating 8p from the left-hand side of equations (33) and (34), it is found that 

The value of fi(r/5p can be obtained from equation (35), and substituted in equation 
(33) to give the value of Sp. Hence the change of curvature can be calculated directly 



Behaviour of continuous stanchions 


507 


from the changes in bending moment SM and axial load These equations only 
apply as long as 0, the point of intersection of ab and a^bi (figure 13c), lies between 
a and c, i.e. ^ , 


Hence 


(j<(r^ and p{d—(T)<p^{d—(Tfi, 


It is found that altogether it is necessary to consider ten sets of equations covering 
all the possible cases of change of stress. Thus the point of intersection 0 of the strain 
lines ab and in figure 13 c may fall on ba produced, within oc, within cc?, within 
db or on ab produced, giving five possibilities. For each of these positions of inter- 
section, the inclination of to the line of zero strain may be greater or smaller 
than that of ab, resulting in ten cases, each of which gives a different type of stress 
distribution. In the example quoted above, it has been assumed that the initial 
stress distribution is one in which overstrain has not already occurred, but since only 
changes of stress are being considered, it is usually — ^though not always — ^possible to 
apply the same equations to more complicated initial stress distributions. 


6. The theoretical behaviour of a stanchion 

BENT IN DOUBLE CURVATURE 

Having obtained a relationship between the changes in curvature, axial load and 
bending moment for any initial stress distribution, it is possible to carry out a 
step-by-step process of integration along a stanchion, after ^suming some definite 
increase in axial load. This process has been applied to a stanchion bent in double 
curvature, and the results are illustrated in figure 14. 

Diagram a of figure 14 corresponds to the beam load at which the lower yield stress 
is first reached in the stanchion. Up to the full beam loading, figure 146, all strain 
increments in any fibre are of the same sign, and consequently no overstrain effects 
are introduced. However, as soon as direct axial load is applied, there is a decrease 
in stress in practically aU the tension fibres, and the greater part of the material in 
the tensile plastic zone in diagram 6 immediately begins to show overstrain effects. 
The state of the stanchion when the total axial load is 3-00 tons is represented by 
diagram c, the area abc being that initially strained beyond the yield point in tension 
and subsequently stressed below the yield value. 

According to the analysis, as the total axial load increases, the compressive plastic 
zone extends towards the centre of length of the stanchion. At the same time the 
terminal bending moments decrease, so causing a decrease in stress in some of the 
extreme compressive fibres near the ends. This begins to occur first when the total 
axial load is somewhat under 6-42 tons, and consequently there is a compressive 
overstrain zone at this load denoted by def in diagram d. The maximum bending 
moment no longer occurs at the ends of the stanchion but at sections K and JT' at 
a distance of 0*62 in. from E' and E'^ (diagram a) respectively. The remaining dia- 
grams e and/ correspond to axial loads of 7-50 and 7-90 tons respectively. It will be 
seen that as the axial load increases the compressive plastic zones become deeper and 
continue to approach nearer to the centre of length of the stanchion. The compr^- 



608 


J. F. Baker, M. R. Horne and J. W, Roderick 

sive overstrain zones also increase in length. The end bending moments decrease 
to zero and then increase with reversed sign, being 0*076 ton-in. when the axial load 
is 7*50 tons and - 0*132 ton-in. at 7*90 tons. 




6*42 7*50 7*90 

d e f 

overstrained in compression and subsecruentlv 
stressed belov the yield \^ue ^ 

overstrained in tension and subsequently 
stressed below the yidd value 

various 


PiGTOKB 14. The deflected form of a stanchion bent in double curvature at 
stages of loading, sho-wingthe extent of the plastic zones. 


I^ie relafaonsMp between axial load and the rotation of the central section is shown 
by ci^e 1, figure 16. As the load approaches 7-90 tons the central rotation is in- 
creasmg rapidly, in fact, to such an extent, that this load could fairly be said to 
represent the theoretical coUapse load. When an analysis was carried out neglecting 
overat^n, i.e. assuming that the stress-strain relationship shown in figure 56 was 
reversible, curve 2 was obtained, showing a collapse load of 6-43 tons. These theo- 
re ica curves may be compared with the observed points, the observed collapse load 
ben:^ 7-33 tons. Thus the theoretical coUapse load was 12-3 % too low when over- 
s ram was n^lected, and 7-9 % too high when overstrain was allowed for. The 
agreement obtained when overstrain is aUowed for is therefore better than when it 
IS neglected, md the remaining discrepancy is possibly due to the fact that the 
stress-stram relation has been over-simplified. It is possible that in the ‘unloading’ 



509 


Behaviour of continuous stanchions 

range the relation naay be more accurately represented by some curve such as CDE 
(figure 56) rather than by the straight line CM (Howard & Smith 1925 ). However, 
despite this, the work carried out does show that the effect of overstrain is appreci- 
able, but that to ignore it gives a calculated collapse load on the safe side. 



Figubb 15. Comparison of observed and theoretical relationships between total axial 
load and rotation of the central section for a stanchion bent in double curvature. 


References 

Baker, J. F. 1934 Second Report, Steel Structures Research GommiUee, H.M.S.O. 13. , 

Baker, J. F. & Holder, P. D. 1936 Final Report, Steel Structures Research Committee, H.M.S.O. 
436. 

Baker, J. F. and Williams, E, L. 1936 Final Report, Steel Structures Research OommiMee, 
H.M.S.O. 611. 

Howard, J. V. & Smith, S. L. 1925 Proc. Roy. Soc, A, 107, 113. 

Roderick, J. W. 1948 PhiL Mag. 39, 529. 




Studies in polymerization 
V. The polymerization of vinyl acetate 

By G. Dixon-Lewis, Courtaulds Limited^ Maidenhead, Berks. 
{Gornrnunicated by A. H. Wilson, F.B. 8 . — Received 21 December 1948 ) 


The velocity constants of propagation, transfer and termination in the polymerization of 
vinyl acetate at — 15 and 0° C have been determined by the viscosity method. A comparison of 
the results with those of other authors has shown that the values for the propagation and 
termination constants given in table 6 are probably correct to within a factor of 2. 

The quantum yrield of the initiating reaction has been determined, and was found to be 
quite small, about 0*04. 


IXTRODITCTIOX 

In recent years a number of investigations have been carried out on the polymeriza- 
tion of liquid vinyl acetate, but in no case have reliable values for all the velocity 
constants been determined. Bamford & Dewar (1948 a), working with styrene, have 
shown that four types of reaction are important in vinyl polymerizations, which are 
chain processes — ^initiation, propagation, transfer and termination. They have been 
able, by a new viscometric method, to determine all four velocity constants. In the 
case of vinyl acetate, Burnett & Melville (1947) and Swain & Bartlett (1946) have 
determined the propagation and termination constants by the rotating sector 
method of Briers, Chapman & Walters (1926), combined with independent deter- 
minations of the rate of chain starting. However, their values for the termination 
constant differed by a factor of nearly 40 , and it seemed that further determinations 
were desirable. Earlier work by Bagdassarian (1944) has also shown the importance 
of chain transfer in the polymerization of vinyl acetate, the average molecular weight 
of the poljmer formed by irradiating the liquid at low light intensities being deter- 
mined primarily by the chain-propagating and chain transfer reactions. 

In the present paper, all the velocity constants for the reactions involved in the 
polymerization of vinyl acetate have been determined by the method of Bamford 
& Dewar (1948 a). The quantum yield of the initiating reaction has also been 
measured. 

Purification of vinyl acetate 

Commercial vinyl acetate was dried over calcium chloride, and then fractionated 
through a 20-plate column in an atmosphere of nitrogen. The sample was thoroughly 
degassed, and all further purification was carried out in vacuo (p< IQ-’^mm.). An 
all-glass apparatus was used, and at no stage in the purification was the liquid or 
its vapour allowed to come into contact with any greased joints or taps. The last 
traces of water were removed from the liquid by standing for 2 hr. over a few pieces 
of freshly cut lithium, after which the vinyl acetate was distilled and sealed off in 
a quartz tube fitted with a vacuum breaker. 


[ 510 ] 



Polymerization of vinyl acetate 511 

The final stage of the purification consisted in the removal of the last traces of 
acetaldehyde, which is present in commercial vinyl acetate, and df the last traces 
of inhibitor. The removal of the former substance is important, since, not only is 
it a photosensitizer for the polymerization, but also it is an active chain transfer 
agent. Its removal was eventually accomplished, together with the removal of 
inhibitor, by prolonged irradiation with a hot ipercury arc, using a Pyrex glass filter, 
followed by heating the Hquid to 60° C for 2 or 3 days. Under these conditions the 
acetaldehyde was almost entirely responsible for 'the initiation of the polymerization 
reaction, and was itself removed in the initiating process. Preliminary experiments 
in a Pyrex vacuum viscometer of the usual type had shown that vinyl acetate ^con- 
taining traces of acetaldehyde underwent appreciable polymerization at 25° 0 in 
the dark. The final product did not do this. 

The vinyl acetate was finally distilled in vacuo from the quartz tube into the quartz 
vacuum viscometers in which the rate measurements were carried out. The visco- 
meters were of the type already described by Bamford & Dewar (1949 a). 



FiatnaE 1 . The absorption spectrum' of vinyl acetate as measured by 
(1) Bximett & Melville, (2) the author. 

The absorption spectrum of vinyl acetate 

The absorption spectrum of a sample of vinyl acetate purified as above, and 
distilled in vacuo into a quartz ceU 5 cm. long, was determined in the region of wave- 
lengths 2900 to 3100A by means of a Hilger medium quartz spectrograph and 
Spekker photometer. The sample had a molar decadic extinction coefficient e = 0*014 
at A = 3000 A, whereas at this wave-length acetaldehyde has e = 14 (Schou 1929). 
The upper limit for the concentration of this substance in the sample is therefore 
10”®moles/L 

The absorption spectrum of vinyl acetate in hexane was also determined in a 1 cm. 
cell using the above instruments. The vinyl acetate had been dried, and then dis- 
tilled and fractionated in an atmosphere of nitrogen. The spectrum is shown in 
figure 1, curve 2. It will be noted that the light absorption by the monomer is con- 
siderably less than that obtained by Burnett & Melville (1947, figure 1, curve 1). 

Vol. 198. A. 


34 



512 


G. Dixon-Lewis 


Summary of the method 

The method used to determine the velocity constants in the polymerization is 
that described by Bamford & Dewar (1948a), in which viscosity measurements are 
used to follow the course of the polymerization. It has already been mentioned that 
pure vinyl acetate does not polymerize at 26 ° 0 in the dark, i.e. Jfci = 0, and this fact 
has made it necessary to modify the method very slightly. The rate of chain starting 
in the photochemical experiments is simply proportional to the light intensity. The, 
complete reaction scheme is therefore as follows: 


initiation 

2M^2X or D2 

rate = AI\M]: 

propagation 

X ■]- —^X 

(^2) 


’’D^+M-^D^+B 

( 2 ^= 3 ) 

transfer 

D, -j- Pn B 

1 “ 

(*3) 


[b+m-^p^+b 

(^3) 


“1" Q 

(2*4) 

termination 


(*4) 

jFJl + Q 

ih) 


2X—>2Q 

. ih) 


where X represents any active centre and Q any dead centre, is the initial polymer 
growing at both ends, is a polymer growing at one end only, B is the growing 
transfer polymer, and and ij are dead transfer and initial polymers respectively. 

In order to interpret the results of the viscosity measurements, it is also necessaiy 
to know the relation between viscosity, concentration and mean degree of poly- 
merization. The relations are now known to be as follows: 

(a) For homogeneous polymer (Houwink 1940) 


M = K{M^PY, (2) 

where [1/] is the intrinsic viscosity, and is equal to the ideal specific viscosity rj 
divided by the concentration of polymer in base moles/L, Mq is the molecular weight 
of the monomer, P is the degree of polymerization and K and a are constants. 
From (2), the following relations can be derived (Bamford & Dewar 19486): 

(6) For heterogeneous polymer produced by the growth of a single radical centre, 
e.g. for the transfer polymer above, 

y = K'iMoPYc, ( 3 ) 

where P is the number average degree of poljmerization, and 

K' = KV{2+cc). 

(c) For a polymer prepared by the growth of a double radical, 

y^K''{MoPYc, ( 4 ) 


X" - ■^r'(3-(-a) _ ^, 2-fa _ 

21+a. ~ ^ 2 ^+“ ~ 


where 



Polymerization of vinyl acetate 513 

Equations (3) and (4) have been used in the present -work. Eor polyvinyl acetate, 
the results of Wagner (1947) lead to the following values, when c is expressed in the 
above units, and the solvent is acetone at 25° C: 


Z' = 2-28 X 10-3, a = 0-68. • 


It should be noted that tj in the above equations represents the ideal specific 
viscosity (i.e. c['^], where c is the concentration of polymer). The viscosity values 
given later are all ideal values. They have been determined from the observed 
specific viscosities at concentration c by the application of the equation of Schulz 
& Sing (1943): 


l + 0 - 28 {^^),^ 


( 5 ) 


which these authors claim to hold for dilute solutions of all polymers. In the present 
experiments, no measurements were carried out on solutions having a specific 
viscosity greater than 1. This corresponds to a conversion of less than 1 %. 

If it is assumed that the chains are long and that and are independent of 
the length of the growing polymer chain, the following two expressions can now be 
derived: 

(a) The number average degree of pol3nnerization is given by 


p ^^2 

^^2k^+{k^Air 


Thus, if the polymer is formed by irrachating the monomer by light of very low 
intensity, . __ 

P = kjk^^ (6) 

{b) The rate of change of the ideal specific viscosity in the photochemical poly- 
merization is given by 


drj 

dt 


lie \ lie \ 

= (I) (^i)i 




( 7 ) 


With the exception of ijg/Al and all the terms in equation (7) are known. Rate 
measurements at a number of light intensities enable the above two quantities to 
be calculated by trial and error. The value of A depends, of course, on the units used 
for the light intensities. It should be noted that only relative light intensities need 
be known at this stage; in a later section the absolute light intensities have been 
determined in order to calculate the quantum yield for the initiating reaction. 

In order to determine the three velocity constants absolutely, one further relation 
between them is required. This is provided by measurement of the photochemical 
after-effect, or, more strictly, the difference between the after-effect and the pre-effect. 
This quantity, which will be denoted as arises from the fact that when the light 
intensity is altered, e.g. from 1 to zero, the rate of polymerization does not im- 
mediately fall to its dark value, but approaches this latter value gradually. Similar 
considerations apply when the intensity is varied in the reverse direction. 8rfj- is 
defined as the difference between the observed ideal specific viscosity at time t^ 


34-2 



614 


G. Dixon-Lewis 


and that calculated on the assumption that the rate changes instantaneously to its 
final value on turning on or cutting off the light. Experimentally, it can be deter- 
irtin ed by means of two irradiations at the same intensity, but for different times, or 
as will be shown later. 

The method of calculating has already been outlined by Bamford & Dewar 
(1948 a); in the present case = 0), it can be shown that, when a = f , 


= IK' Ml 'fiAIf ^2 ^3 7-2 tan-^ ^ ^ ^ 

{(y^ + [X]ld)i-yY Z{2Ho-l) 

y (y 3 + [xyd)* + y(y3 + [Z]/0)i + 72 + (73 + [X]l6)i 

y^+2HDl^, {y^ + [X]l6) — v^ 

^ A« ^®(73 + [Z]/5)*+.A(73 + [Z]/0)HA2 


where 


-V3tan-xg(f.±m^g^A| 

V® Y^°^{y3 + lX-]ld)i + v{'f + [X]l8)i + V^ 

3(2“m;-1)76 1 

yS ^ *3 ;^3 = *3 , j 


(8) 


Hence, hjk^, k^jkl and A being known, it is possible to determine k^ from a measure- 
ment of Srfj^ at a known light intensity. 


Expeeimental 

^ (a) Measurement of P 

Small amounts of polyvinyl acetate were prepared at 0 and at 25° C by irradiating 
the pure monomer in a quartz tube at low light intensities where, to a sufficient 
approximation, equation (6) holds. 

The light source used was a hot mercury arc combined with, at 0° C, a hot (100° C) 
50 % acetic acid fiOlter (see below), and at 25° C a Pyxex glass filter. 

Solutions of the polymers so prepared were made up in acetone, and their specific 
viscosities determined at 25° 0 with an Ostwald viscometer. The number average 
degrees of polymerization could then be determined by means of equation (3), which, 
with substitution of Wagner’s values for K' and a, becomes* 

^/ = 2-28xlO-3(JfQP)tc. 

* It is necessary to use rational values of a to allow calculation of Stj. Compare Bamford & 
Dewar {1948 a). The value | has therefore been used instead of the experimental 0*68. 



Polymerization of vinyl acetate 515 

Similar experiments were also carried out on the 0° C polymer using vinyl acetate 
monomer as solvent, with almost identical results. The values for P are given in 
table 1. 


temperature of 

Tabile 1 

Px 10”^ calc. 

polymerization 

P X 10-^ 

with vinyl acetate 

rc) 

exp. 

as solvent 

0 

2-045 

2-15 

25 

1-300 

— 


The last column of table 1 shows that the same value of K' can be used when the 
solvent is vinyl acetate. 

(6) Measurements of rates 

The photochemical rates were measured at 0 and — 15° C for a number of different 
light intensities. At 0° C the viscometer was enclosed in a quartz tube through which 
water from an ice-bath was circulated. For the measurements at ~ 15° C a specially 
designed thermostat tank was constructed with double quartz windows. The visco- 
meter was also clamped in this tank in such a way that the viscosity of the polymer 
solution could be observed without difficulty. The light source used in both cases 
was a hot mercury arc combined with, the hot 50 % acetic acid filter, tinder these 
conditions very little light of wave-length less than 2650 A reached the vinyl acetate, 
and although the 2650 A line is absorbed by the vinyl acetate rather more strongly 
than is absolutely desirable to ensure an approximately uniform rate of chain 
initiation throughout the liquid, it has been necessary to use light of this wave- 
length in order to produce sufficiently high light intensities' for the application of 
the method.* This procedure does not appear to have introduced any very serious 
error, but it is possible that the value determined for the termination constant may 
be somewhat too large. Any error has been removed as far as possible by carrying 
out the measurements at 0 and at — 15° 0, where the light absorption would be 
expected to be smaller than at temperatures above 0° C. Although it was found to 
be impossible at — 15° C to vary the light intensity sufficiently to produce an abso- 
lutely unambiguous intensity-rate curve (figure 3), the results obtained are regarded 
as sufficiently accurate to allow the determination of the constants to within quite 
narrow limits. 

The occurrence of the photochemical after-effect excludes the possibility of 
measuring the rate under illumination by means of a single exposure. The technique 
has therefore been to carry out two irradiations at the same light intensity, for t^ 
and ^2 s©c. respectively. If the viscosity is then measured at such a time after each 
irradiation that the after-effect is complete, then the difference between the total 
viscosity increases in the two cases is given by 

= a(^2 ■” ^i)> 


* Consideration of equation (7) shows that at low light intensities the photochemical rate is 
proportional to the square root of the intensity, whereas at high light intensities, and when 
a = f , it is proportional to J*. For the application of the method, it is necessary to study the 
intermediate region of the intensity-rate curve. 



516 


G* Dixon-Lewis 


where a is the photochemical rate. Theoretically, in the case where there is no dark 
reaction, the after-effect should be infinite (equation (8) with [Xjj = 0), but in prac- 
tice it was found that the viscosity became sensibly constant in less than 20 mm. 
after the end of an exposure. The viscosities were therefore always determined after 
this time had elapsed. 

The relative incident intensities were measured by means of a photocell-galvano- 
meter system. 



Figuhb 2, The mtensity-rate curve for the polymerization of vinyl acetate at Q® C 
(experimental points and calculated curve). 


The results were fitted by trial and error to equation (7), and values of h^jkl and 
A determined, the values of being already known (table 1). The values used 
for K' and a were 2*28 x 10“® and | respectively. The experimentally determined 
photochemical rates at 0 and — 15° C are given in table 2, and the values of the 
constants giving the best fit in table 3. The experimental points are shown together 
vdth the intensity-rate curves calculated firom these constants in figures 2 and 3; 
the agreement is good. The value of k^jk^ at — 15° C was obtained by extrapolating 
the curve of log P against 1 / T® K. 

Table 2. The PHOTOOHEikiCAL rates of polymerization op vinyl acetate 


temperature 0° C temperature — 15° C 


I 

10^d^/di5(sec.“^) 

I 

I0^d7jldt(seo-^) 

(arbitrary units) 

(obs.) 

(arbitrary units) 

(obs.) 

43-5 

8*4 

M 

3*3 

80 

10-8 

4*1 

7-1 

812 

36*7 

26 

16*9 

1632 

42*6 

83 

24-0 

3182 

53-5 



3223 

54-5 



5753 

64*0 





Polymerization of vinyl acetate 517 

Table 3. The ratios of the velocity constants in the 

PHOTOPOLYMERIZATION OF VTNYL ACETATE 

^ 2/^3 ^ 3/^4 (arbitrary constant) 

2-045x10^ 9*3x10-6 1*66 xlO-’ 

2*786x10^ 4*9x10-6 5*0 x 10“’ 

(c) Measurement of the photochemical after-effect 

It has already been stated that when there is no dark reaction, the photochemical 
after-effect should be infinite. That the observed after-effect is not infinite must 
therefore imply that some side reactions terminate the kinetic chains when the 
concentration of radicals has become very small. Although the error so introduced 
was unimportant when determining the photochemical rates, it becomes exceedingly 
important when measuriag the after-effect itsejf. An alternative method has there- 
fore been adopted for measuring the latter quantity. Two irradiations were carried 
out at the same light intensity, and for the same total time; the first irradiation 
being continuous, and the second intermittent, e.g. the first exposure was of 20 sec. 
duration, and the second was divided into four parts, each of 5 sec. duration, with 
15 sec. intervals. The difference between the two viscosity increases is then equal to 
in equation (8). 



FiGXiaB 3. The iatensity-rate curve for the polymerization of vinyl acetate at — 15° G 
(experimental points and calculated curve). 

The light intensities were measured as before, and in the same arbitrary units. 
The exposure times were accurately controlled by a shutter operated electro- 
magneticaUy by a clock motor and commutator. 

The values of the after-effects so determined are given in table 4, The smaller after- 
effects at — 15° G were considerably less accurately measurable than those at 0° C, 
and the value of the mean experimental after-effect at the former temperature is 
such that the termination constant would possess a small negative activation energy. 


temperature 

(°C) 

0 

-15 



618 


G. Dixon-Lewis 


J &4 has therefore been adjusted to zero, and the value of the mean experimental 
after-effect at - 15° C calculated using the value of x 10^) found at 0° C. The 
calculated value is seen to be within the limits of experimental error. 

Table 4. The photochemical aftbr-efeect ik the 

POLYMERIZATION- OF VINYL ACETATE 
I 

(units as in (mean experi- BiffY x 10® 
tables 2 and 3) mental value) (calc.) 

5200 12-7 12-8 

30 6-0 5*8 

(individual experiments 
6*1, 5*4, 5*1) 

Results (velocity constants, activation energies and 

FREQUENCY FACTORS) 

The velocity constants (in l.moL”^sec.“^) at 0 and — 15° 0 for the polymerization 
of vinyl acetate are given in table 5. Table 6 gives the energies of activation and the 
frequency factors. The latter are probably correct to within a factor of 2, and the 
former to ± 1 kcal. 

Table 5. The velocity constants in the polymerization 
OF VINYL acetate 

temperature , 

("C) k, k^ 

0 2*8x10® 1*4x10-1 2*2x10® 

-15 2*0x10® 7*3x10-2 2*2x10® 

At these temperatures the value of (thermal initiation) is so small as to be 
indistinguishable from zero. 

Table 6. The activation energies and frequency factors in the 

POLYMERIZATION OF VINYL ACETATE 

• = 3*2 kcal. = 9*8 x 10® 

^3 = 6*1 kcal. A3 = 9*9x10® . 

^4 = 0 A4 = 2*2 X 10® 

The quantum yield of the initiating r motion 

The determination of the velocity constants described above requires only a 
knowledge of the relative light intensities. From equation (7) it can be seen that, if 
the photochemical rate corresponding to a known absolute light absorption is 
measured, then the quantum yield of the initiating reaction can be obtained. At 
very low light intensities, where the square root law is obeyed, equation (7) becomes 





Polymerization of vinyl acetate 519 

The experiments were carried out in this intensity range, at a temperature of 
25° 0. The vacuum viscometer used had a quartz cell of 5 cm. diameter, and 5 mm. 
thick, fused to one end. Light from a hot mercury arc was rendered parallel by a 
series of quartz lenses, and was passed through a series of filter solutions as follows: 

(a) chlorine gas at 1 atm. pressure 3 cm. 

(5) solution of 50 g. NiS04 . GH^O in 100 ml. water 2 cm. 

(c) mixture of 40 % CCI4 + 60 % EtOH. by volume 5 mm. 

The transmitted hght consisted almost entirely of the 2650 A line. The amounts 
of the neighbouring lines transmitted (2537 and 2700 to 2970 A) were sufficiently 
small to be neglected in the experiments. 

For measuring the incident Hght intensity, a uranyl oxalate actinometer was used 
(Leighton & Forbes 1930). It was impossible by means of a single experiment to 
measure both the amount of Hght absorbed and the photochemical rate, since the 
changes produced in the vinyl acetate and m the actinometer by the same amount of 
Hght energy were of quite different orders of magnitude. The Hght absorption by 
the vinyl acetate was therefore calculated from the absorption spectrum (figure 1), 
and the photochemical rate was measured at a given Hght intensity by the method 
previously described, with irradiation times of less than 10 min. The viscometer 
was then replaced by the uranyl oxalate actinometer, which was left in the Hght beam 
for 6 days. At the end of this time the change produced in the actinometer was 
suiffilcient to allow a reasonable estimate of the absolute Hght intensity. The actino- 
meter ceU had a thickness of 3 cm., so that the absorption of the wave-lengths falling 
on it could be regarded as complete. 

The results of the experiments were as shown in table 7. 


Table 7. Quantum yield experiments 


photochemical rate 
rate of chain starting 
total light energy absorbed by 
vinyl acetate/sec. 
hence the quantum yield is 


dTjjdt = 2*56 X 10“^ 

AI\Mf = 5 X lO^Vsec. 

({a) l-35xl0i2^y 


h 26 xl 0 ^%v^ 
0-04 (±0-01) 


,{b) l-l5xWViv 


Discussion 

(i) It is of interest first to compare the values of the frequency factors and activa- 
tion energies of the constituent reactions with those already known for styrene. 
The values for both styrene and vinyl acetate are given in table 8, the styrene figures 
being those obtained by Bamford & Dewar (1948 a). 


Table 8. Comparison oe frequency factors and activation energies 
in the polymerizations of styrene ajnd vinyl acetate 

frequency factors activation energies (kcal.) 



styrene 

• vinyl acetate 


styrene 

vinyl acetate 

A 2 

1-02 X 10« 

9-8xl0« 


6-5 

3-2 


1-50x107 

9-9 X 10* 


14-2 

6-1 


3-07 X 10« 

2-2 X 10* 

-B 4 

2-8 

0 



520 


G. Dixon-Lewis 


The frequency factors A 2 and are the same for both substances within the limits 
of experimental error, but the values of and E^^ for vinyl acetate are considerably 
lower than the corresponding values for styrene. This, of course, would be expected, 
since the resonance energy of the intermediate a-acetoxyalkyl radicals in the former 
polymerization is likely to be considerably lower than the resonance energy of the 
benzyl radicals in the latter. Some of the excess resonance energy of the benzyl 
radical compared with the a-acetoxyalkyl radical would undoubtedly be lost in the 
transition state. 

The values of E^ and A^ for vinyl acetate are of considerable interest, since they 
are appreciably lower than those for styrene. 

(ii) A certain amount of discussion has recently arisen regarding the values of 
the velocity constants in the polymerization of vinyl acetate, and particularly regard- 
ing the frequency factor for termination. Indirect support for the values given in 
table 6 is given by the results of Bagdassarian (1944), who measured, by a dilato- 
metric method, the rates of polymerization at 50° C of vinyl acetate at known light 
intensities. The equation representing the rate of disappearance of monomer is 

(9) 

Substitution in equation (9) of the values for fcg ^tnd at 50° C calculated from table 6 
allows the rate of chain starting AI[Mf to be determined. The quantum yield 
calculated on this basis was found to be about which is in good agreement with 
the present value of 

Bagdassarian has also shown that the molecular weight of polyvinyl acetate 
prepared at low light intensities is independent of the actual value of the intensity, 
and that therefore the degree of polymerization is determined primarily by the chain 
propagation and transfer reactions. It is possible to calculate from his results an 
upper limit for the quantum yield assuming that such transfer does not occur, the 
rate of chain initiation now being obtained by means of the equation 

= PAI\Mf. 

The figure obtained was 0*28. Since the light intensities he employed were quite low, 
and in view of what has been said about chain transfer, it is evident that the actual 
quantum yield must be considerably lower than this. 

In connexion with the difference E^—E^ between the propagation and transfer 
activation energies, it is also interesting to observe that Bagdassarian, from his 
molecular weight determinations, has obtained a value of about — 2 kcal. for this 
quantity, compared with the present value of —2*9 kcal. The low value of E^ is 
therefore confirmed. 

The values calculated for the propagation and termination constants at 25° 0 
^ from the results of the more recent publications are given in table 9, together with the 
frequency factors and activation energies. 

Burnett & Melville, and Swain & Bartlett both used rotating sector experiments to 
determine the mean life of the kinetic chains, and combined these with independent 
measurements of the rate of chain starting. The latter quantity was determined by 



Polymerization of vinyl acetate , 521 

Swain & Bartlett by means of experiments in which benzoyl peroxide was used to 
initiate the chains. In effect, they found a concentration of benzoyl peroxide which 
produced a rate of polymerization equal to that observed in their photochemical 
experiments, and then assumed that each molecule of peroxide decomposed initiates 
one kinetic chain. The rates of decomposition of the peroxide at 25"^ C were known 
from previous experiments by Nozaki & Bartlett (1946). Swain & Bartlett claimed 
their values of and k^ to be accurate to within a factor of 5. A simple calculation 
shows that, if their rate of chain initiation is reduced by a factor of 4, then the results 
are in excellent agreement with the present ones. In this connexion, it is interesting 
to note that Bamford & Dewar (1949 b) have recently found that, in the autoxidation 
of tetralin initiated by benzoyl peroxide, only one chain is initiated for about ten 
molecules of peroxide decomposed. 

Table 9. Values of constants m the polymerization op vinyl acetate 



Swain & 

Burnett & 

present 


Bartlett ( 1946 ) 

Melville ( 1947 ) 

paper 

*2 (25° C) 

1-1 X 103 

1 -Ox 10 ® 

4-6 X 103 


— 

1-65 X 10® 

9-8 X 103 


— 

4*4kcal. 

3 - 2 kcal. 

* 4 ( 26 ° C) 

8 x 10 ’ 

3x103 

2-2x103 


— 

3x103 

2-2 X 103 


— 

0 

0 


Burnett & Melville, on the other hand, determined the rate of chain initiation by 
measuring the rate of disappearance of inhibitor (p-benzoquinone) at a known light 
intensity. It was assumed that one molecule of the quinone was removed per kinetic 
chain, and that aU chain initiation was due to light absorption by the vinyl acetate. 
However, as has already been pointed out by the author (1947), the absorption 
spectrum of p-benzoquinone is such that there is no spectral region where the 
inhibitor technique using this substance can be used with pure vinyl acetate, due to 
its internal fiOlter effect. The argument that the duration of the mduction period of 
the polymerization is directly proportional to the amount of inhibitor is not con- 
vincing, since so little is in fact known about the details of the action of quinone 
inhibition. Burnett & MelviUe^s value of unity for the quantum yield of the initiating 
reaction must therefore be regarded as unreliable, particularly as it is considerably 
higher than the maximum possible value calculated earlier. 

Even assuming a more reliable value for the quantum yield it is impossible to 
bring Burnett & MelviUe’s results into agreement with those of Swain & Bartlett or 
those of the present paper. In fact, on reducing the value of the quantum yield from 
unity, the discrepancy in the values of k^ becomes still larger. However, it is difficult 
to discover an exact reason for the disagreement since Burnett & Melville failed to 
state their exact experimental conditions. Professor Melville, in a private com- 
munication to Swain & Bartlett (see Swain & Bartlett 1946, p. 2386) has stated that 
light was used in the sector experiments of wave-length greater than 2500 A, and 
this would agree with the statement that a cold acetic acid jBlter was employed. 
Swain & Bartlett have therefore suggested that the discrepancy is due to almost 



522 


G. Dixon-Lewis 


the -whole of the light being absorbed by the vinyl acetate in an extremely small 
portion of the reaction cell; and approximate calculations show that this certaip,ly 
brings the results into much better agreement. The high concentration of growing 
chains in such a portion of the cell would also explain why the chain transfer reaction 
could not be detected in their experiments. However, the explanation would appear 
to be contradicted by the statement of Melville & Burnett (1947) that, under the 
conditions which they originally used, a 40 % solution of the monomer in n-hexane 
gives practically the same values for the propagation and termination constants as 
does the pure monomer. 

It is also possible that Burnett & Melville’s method of purification of the vinyl 
acetate was inadequate, and this would, in addition, explain the differences between 
the absorption spectra in figure 1. 

In conclusion, it may be said that the bulk of the experimental evidence seems to 
support the values for the velocity constants given in the present paper. 


References 

Bagdassarian, Ch. 1944 Acta Physicochim. U.B.S.S, 19, 266. 

Bamford, C. H. & Dewar, M. J. S. 1948a Proc, Boy, Soc. A, 192, 309. 
Bamford, C. H. & Dewar, M. J. S. 19486 Proc. Boy, Soc, A, 192, 329. 
Bamford, C. H. & Dewar, M. J. S. 1949 a Proc, Boy, Soc, A, 197, 356. 
Bamford, C. H. & Dewar, M. J. S. 19496 Proc, Boy, Soc, A, 198, 262. 
Briers, B., Chapman, D, L. & Walters, E, 1936 J. Chem. Soc. p. 662. 
Burnett, G. M. & Melville, H, W. 1947 Proc, Boy, Soc. A, 189, 456. 
Dixon-Lewis, G. 1947 Disc. Faraday Soc, 2, 319. 

Houwink, R, 1940 J, prakt, Chem. 157, 16. 

Leighton, W. G. <fc Forbes, G. S. 1930 J. Amer. Ghcm, Soc. 52, 3139. 
Melville, H. W. & Burnett, G. M. 1947 Disc. Faraday Soc. 2, 370. 
Nozaki, K. & Bartlett, P. D. 1946 J. Amer. Chem. Soc, 68, 2377. 

Schou, S. A. 1929 J. Ghim, phys. 26, 77. 

Schulz, G. V. & Sing, G. 1943 J. praht. Ghem. 161, 161. 

Swain, C. G. & Bartlett, P. D. 1946 J. Amer. Ghem. Soc. 68, 2381. 
Wagner, R. H. 1947 J. Polymer Sci, 2, 21. 



The ignition of solid explosive media by hot wires 

By E. Jones, LCJ. Limited, Nobel Division, Stevenston, Ayrshire 

{Communicated by F. P. Bowden, F.R.8. — Received 29 January 1949 — 
Revised 7 April 1949) 


The present paper describes an investigation of the physical factors affecting the ignition 
of solid explosives by heated filaments embedded in the medium. The filaments were com- 
posed of fine resistance wire and were heated electrically, the critical thermal energy required 
to cause ignition being measured for wires of different geometrical, thermal and electrical 
characteristics and for different times of heating. Systematic variation of these factors 
enabled the energy equation for the ignition process to be formulated and its terms analyzed, 
the technique involving extrapolation to zero time of heating as a means of elimmating 
heat losses from the ignition system, and extrapolation to zero diameter of wire in order to 
eliminate terms involving the heating element; the former simulates the ideal case of a 
heat-insulated ignition system and the latter that of a line source of heat. 

The energy equation for ignition in these circumstances takes a simple form which implies 
that, at the moment of ignition, the heat supplied to the ignition system always equals the 
heat gained by the system plus the heat lost, the absence of any term representing heat 
generated by chemical action being very significant. For a given ignition system, the amount 
of heat absorbed up to the moment of ignition is shown to be independent of time, so that 
the increase in ignition energy with increasing time of ignition is wholly attributable to the heat 
losses sustained by the ignition system during the heating process. Further analysis shows 
that the critical factor governing ignition in systems of the tjrpe considered is the temperature, 
and that the geometry of the heating element probably determines the amount of explosive 
which must be raised to the critical temperature to ensure ignition. 


Introduction 

A thermal theory of ignition and flame propagation is attractive, especially to 
physicists and engineers, because it involves temperature and heat, the essential 
elements for the application of thermodynamics to explosion problems. Briefly, 
the thermal theory postulates that an explosive medium ignites spontaneously when 
its temperature reaches a certain critical value, known as the ‘ ignition temperature 
and that a flame, Once started, is capable of self-sustained propagation if the heat 
communicated from the burning layer to the adjacent unburnt layer is> sufficient to 
raise the latter to the ignition temperature. One method of testing this theory would 
be to generate heat in the explosive medium and deduce the critical temperature 
from the critical quantity of heat required for ignition. Care should be taken, 
however, to choose an unambiguous source of heat, since the generation of heat, 
especially in a gas, is liable to cause expansion and, if this occurs very rapidly, much 
of the energy of the source may be wasted in doing mechanical work. For example, 
an electric spark passed through a loose heap of even sensitive explosives like gun- 
cotton or mercury fulminate may succeed only in scattering the material, whereas 
a smaller spark will cause ignition if the explosive is constrained. A similar effect 
may be observed when a detonator is fired in a heap of gunpowder or when a charge 
of certain high explosives is fired in an explosive firedamp atmosphere. It is not 
sufficient merely to measure the energy dissipated by the igniting source; the pro- 
portion of this energy appearing as sensible heat in the medium must be determined. 

[ 523 ] 



624 


E. Jones 


Agam, if the process of raising the explosive to its ignition state occupies a finite 
time, some loss of heat to the surrotmdings is inevitable and such losses should, of 
course, be taken into account. 

Bearing these observations in mind, it will be seen that the use of a sohd medium, 
with its negligibly small coefficient of expansion in comparison with that of a gas, 
offers a means of reducing to insignificance the energy wasted in doing work against 
atmospheric pressure, no matter how rapid the heating process. Further, the use of 
a rigid medium inhibits relative movement, thus preventing loss of heat by convexion 
or by creating kinetic energy. These advantages appear to be afforded by the method 
described by Morgan (1925), wherein use was made of the smaU electric igniters, or 
low-tension fuseheads’, which form a part of the ordinary commercial electric 
detonator. 

The low-tension electric fusehead, as used by Morgan, consists essentially of a 
match-head formed round a smaU resistance element, the heating of which on passing 
a stiitable current causes the match-head to ignite. The match-stalk consists of 
a narrow strip of cardboard with metal foil on either side, the foils being ‘bridged’ 
at the tip by afine resistance wire attached by soldering. The tip is ‘stepped’ so theft 
when the match-head is formed by the usual process of ‘dipping’, the sensitive 
composition fills the step and thus completely surrounds the bridge-wire for most of 
its length. The construction of the device is shown diagrammatically in figure 1. 


match -head composition 


bridge K: 
wire “ 



solder 
pressboard 


'brass foil 



Figitee 1. Biagraminatic sketch of low-tension fusehead. 


In Morgan’s experiment ‘the object was to find the current required for ignition 
when the duration of current was limited to a definite interval of time’, and the 
conclusion was that ‘experiments on the ignition of highly inflammable solids by 
me^s o very short hot wires show that the energy required for ignition increases 
with d^ution of the rate of heating and that, over a wide range, a linear relation 
exists between the energy and the time during which it is supplied’. 

If the assumption is now made that the ignition energy has a specific value for 
a ^ven type of fusehead, it foUows that its apparent increase with diminution of the 
rate of heatmg must be due to increasing energy losses with time; in which case the 
lo^es can convemently be eliminated by extrapolating to zero time of heating. This 
critical value can be derived firom Morgan’s results, but its ftdl significance cannot be 
ascertained because of the uncertainty regarding the way in which this energy is 



525 


Hot-wire ignition 

divided between the bridge-wire and the explosive medium. If by a suitable extension 
of Morgto’s experiment this difficulty can be resolved, it should then be possible to 
measure the critical energy which must be given to the explosive in order to effect 
ignition. The present experiments were undertaken with this object in view. 

It is significant to note that the above argument is based on the assumption that, 
in ignition systems of the type under consideration, the critical energy for ignition 
is independent of time and that it is the loss of energy in the ignition process which 
varies with time. This, therefore, is the hypothesis underlying the present investiga- 
tion and one object of the work will be to test this hypothesis. It is also worth noting 
that this hypothesis is implicit in our statement of the thermal theory of ignition so 
that this test is, in fact, a test of the thermal theory. 


Experimental 

For commercial use, fuseheads of the type described require the use of very sensi- 
tive explosives, but, so that the present investigation should not be unduly restricted 
in this respect, three match-head compositions of varying sensitivity were examined. 
The compositions were chosen to include both single explosive compounds and 
heterogeneous explosive mixtures, one a mechanical mixture of non-explosive 
ingredients. By thus introducing a wide range of chemical variables, it was hoped to 
subject our purely physical hypothesis to a more stringent test. The compositions 
actually selected were: (1) copper acetylide, (2) a mixture of four parts by weight of 
lead mononityoresorcinate (LMNE) to one of potassium chlorate, and (3) a inixture 
of five parts by weight of potassium chlorate to one of birchwood charcoal. The 
powders were bonded together to form a rigid matrix by the use of a small amount of 
nitrocellulose in each case. 

The bridge-wire can vary in material, length and diameter, aU of which are capable 
of more or less rigid control. The bridges actually employed in the experiments 
comprised a series of nichcome wares varying in length from 0-9 to 2*9 mm. and in 
diameter from 1*3 to 7-8x10”® cm., together with some miscellaneous wires of 
copper, platinum, tin and lead. 

To fire the fuseheads, direct current was applied through a pendulum time- 
switch which could be adjusted to give time intervals from 1 to 60 x 10“®sec. The 
instrument was cahbrated at each setting by measuring with a fluxmeter the 
(quantity of electricity passing through the switch on applying a known small 
current. 

In each experiment carried out at a particular setting of the time-switch, some 
preliminary trials were made to ascertain roughly the value of the mean fixing current 
corresponding to the time of application concerned and then twenty tests, each with 
a fresh fusehead, were made at each of two currents, one slightly above and the other 
just below the expected mean, the proportion of ignitions occurring at each current 
being noted. The true value of the mean firing current for that particular time of 
current application was obtained by interpolating the point at which 50 % ignitions 
would occur. 



E. Jones 


526 

It may be noted that the ‘mean firing current’ is associated with a unique value 
of the application time and, to avoid any ambiguity, this will be called the ‘ excitation 
time’. In other words, the excitation time is the time for which a specified current 
must be applied in order to fire, on the average, 50 % of the fuseheads. Similarly, 
the ‘mean firing current’ is the value of the firing current which, when applied for 
a stipulated time, causes 60 % of the fuseheads to ignite. 

Time of current application 

Using the experimental procedure outlined above, the mean firing currents corre- 
sponding to various times of current application were determined for a batch of 
fuseheads of the same kiad. In this case, the bridge-wire had a mean length of 
l-51mm. and a mean resistance of l-18£i, the original wire being nichrome of dia- 
meter 0-0042 cm. and linear resistance 7-80/cm. The fusehead composition consisted 
of four parts of LMNR to one of potassium chlorate, the materials being bonded 
together with nitrocellulose. Four periods of current application were selected, viz. 
50, 25, 12 and 5 x 10-® seo., and the results obtained are represented graphically in 
figure 2. 



0 10 20 30 40 50 

excitation time ( x 10~® sec.) 

Figube 2. Relation between ignition energy and excitation time. 

It wiU be seen that, whereas the mean firing current falls with increasing applica- 
tion time, the critical energy rises and, moreover, that a linear relationship exists 
between the critical energy for ignition and the excitation time, confirming Morgan’s 
earlier conclusion. Thus, the results, expressed in terms of energy, can be represented 
by the following equation: 

E = A + Bt, (1) 

where E is the critical energy for ignition corresponding to an excitation time t, 
A is the intercept on the energy axis and B is the slope of the hne. So far, then, the 
results confirm our hypothesis in that the critical energy for ignition under given 




Hot-wire ignition 527 

conditions comprises two parts, one independent of time and the other a function 
of time. The results further show that the latter term is directly proportional to 
the excitation time. 

Equation (1) is the energy equation for the ignition process and, in the general 
case, will contain thermochemical and thermodynamic as well as sensible heat terms. 
Since, however, the means of igmtion in these experiments was purposely chosen to 
avoid thermodynamic complications, this equation may be taken to express a heat 
balance, in which case it may be interpreted as meaning that, of the total heat, E, 
supplied to the ignition system, a part, A, is retained in the system and the remainder, 
Bt, escapes. Accepting this interpretation, it follows that B represents the rate of 
loss of heat by the system, and the interesting conclusion is reached that this is 
a constant with respect to excitation time. 

It is desirable at this point to form some picture of the ignition system, and it 
seems reasonable to suppose that the ignition system comprises the bridge-wire 
and that portion of the surrounding explosive medium which must be brought to 
some critical condition in order that a self-sustained reaction becomes possible. As 
yet, the amoimt of explosive involved in an ignition is unknown, but if, as a first 
approximation, it is assumed that all the heat retained by the system is contained 
in the wire, the temperature of the wire can be calculated, and this, at least, gives 
a figure which the temperature of the system as a whole cannot exceed. The value of 
A obtained from figure 2 is approximately 4-1 x 10“^ joule, or 1 x 10“®cal., and, as 
the thermal capacity of the wire was about 2 x 10~® cal./° 0, the maximum wire 
temperature is approximately 500® C, which, as we have seen, must be an over- 
estimate. On the other hand, the rate of loss of heat, By is approximately 25 x 10""® 
cal./sec., so that, if the excitation time — or, as it might be called, the ‘ lag bn ignition * 
— ^were 1 sec., the quantity of heat required to effect ignition would be 26 x 10~® caL, 
i.e. sufficient to raise the wire to a temperature of about 13,000® 0. It will be seen, 
therefore, that when the ignition process occupies a finite time and due allowance 
is not made for heat losses, the apparent ‘ignition temperature^ is liable to be 
grossly exaggerated. 

As the derivation of the quantity A involves extrapolation to zero excitation time, 
the physical significance of this procedure requires examination. The times measured 
are, of course, the times of application of the heating current and, in extrapolating 
to zero excitation time, we are simulating conditions in which sufficient heat is 
supplied instantaneously to make ignition inevitable but not necessarily immediate. 
The instantaneous supply of a finite quantity of heat to a real system is, bf course, 
a physical impossibility, and, even if such a thing were possible, it is not likely that 
uniform distribution of the heat throughout the system would result immediately. 
On the other hand, if these processes occupy a time which is short relative to the 
measured excitation times, they can be assumed to occur instantaneously, relatively 
speaking. A clearer picture is probably afforded if the extrapolation process is 
regarded as a convenient method of approaching the ideal state in which no heat is 
lost from the ignition system, i.e. the ideal case of a perfectly insulated system. The 
rate of loss of heat from the system is given by the slope of the line, and the constancy 
of this slope throughout the experimental range justifies extrapolation to the 


Vol. ig8. A. 


35 



528 


E. Jones 


hypothetical point where the time interval is too short to permit any such heat to 
escape. The heat contained in the system at this point may be expressed in calories 
or, alternatively, as a function of the temperature of either the wire itself or the 
ignition system as a whole. 

Bridge-wire characteristics 

Since the ignition system comprises only two parts, it follows that, if the total 
energy retained in the system is known, the amounts claimed by both components 
are known once the portion contained in any one component is isolated. It has 
already been seen that extrapolation to zero time simulates the case of a heat- 
insulated system, which satisfies our first condition. With a wire of given material, 
e.g. nichrome, the bridge-wire is completely characterized by its length and diameter, 
so that, if these two factors are successively eliminated, the wire vanishes and we are 
left with a one-component system, thus satisfying our second condition. 

This process, however, gives no direct information regarding the effect of the 
characteristics of the wire material and, although the relevant properties must be 
self-evident once we postulate a purely thermal energy equation, the correctness, or 
otherwise, of this assumption can be subjected to the test of experiment by using 
wires of different thermal properties. Hence, it was decided to determine the effects 
on the critical energy for ignition of (a) length of bridge-wire, (6) diameter of bridge- 
wrihe, and (c) material of bridge-wire. 

(a) Bridge-wire kngth 

To examine the effect of bridge-wire length on the critical energy for ignition, five 
batches of fuseheads were made in which the ^ step ’ lengths were varied so as to alter 
the lengths of the bridge-wires. The fuseheads were bridged, as before, with nichrome 
wire of diameter 0*0042 cm. and linear resistance 7-8£2/cm. As it was necessary to 
test the electrical resistance of each fusehead, the most convenient method of deter- 
mining the lengths of the bridge-wires was to calculate them from the fusehead 
resistances and the known linear resistance of the wire. The mean lengths of the 
bridge-wires in the five lots of fuseheads were 0*87, 1*03, 1*61, 2*03 and 2*85 mm. 
respectively. The same match-head composition was used throughout, namely, 
80 % of LMNR and 20 % of potassium chlorate. As before, the mean firing currents 
corresponding to four different application times, ranging from 5 to 50 x 10“^ sec., 
were determined for each of the five batches, the results of this experiment being 
represented ^aphicaUy in figure 3. 

As before, the results are represented by straight fines, the critical energy 
decreasing with decreasing excitation time but remaining finite when extrapolated 
to zero excitation time. Both the intercepts on the energy axis and the slopes of the 
lines vary with the length of the bridge-wire, so that, in the light of this new evidence, 
our original energy equation must be qualified, since it is true only for a particular 
length of bridge-wire. Consequently, we must write 

JE A + Bt, when I is constant, (2) 

where I is the length of the bridge-wire.- 

In figure 4, the critical energies for ignition have been plotted against length of 
bridge-wire for different excitation tunes, and, again, the results can be represented 



critical energy ( x 10 * joule) ^ critical energy ( x 10“® joule) 


Hot-wire ignition 


529 



0 10 20 30 40 50 


excitation time ( x 10"® sec.) 

Effect of time of current application on critical energy for ignition. 
(Bridge-wires of different lengths shown in mm.) 



length of bridge-wire (mm.) 

Eigube 4. Effect of bridge-wire length on critical energy for ignition 
(var 5 nng excitation time shown in sec. x 10"®). ^ 


35-2 





530 


E. Jones 


by a series of straight lines cutting the energy axis at positive values and having 
positive gradients. Both the intercepts on the energy axis and the slopes of the lines 
vary depending on the excitation time, so that the relationship between critical energy 
for ignition and length of bridge-wire may be represented by 

E — G+Dl, when t is constant, (3) 

and, on combining (2) and (3) algebraically, we have 

E ^ Jit, when both I and t vary. (4) 



The results, however, show that when the intercepts, E^, on the energy axis in 
figure 3 are plotted against length of bridge-wire, as in figure 5, the line passes through 
the origin, which signifies that, when i = 0, equation (4) reduces to jB = 01, and, 
therefore, = 0. Our energy equation thus becomes 

E=Gl^(H+Jl)t (5) 

The physical interpretation of equation (5) is still that the heat supplied to the 
igmtion system is equal to the heat absorbed plus the heat lost, but now we find that 
the heat absorbed is directly proportional to the length of the bridge-wire, whereas 
the heat losses are of two kinds, one independent of the length of the bridge-wire and 
the other directly proportional to it. As the only variable concerned is the linear 
dimension of the bridge-wire, the fact that the heat losses are of two kinda im- 
mediately suggests end-effects and lateral eiBfects. In other words, there is loss of 
heat from the ends to the soldered joints and the metal foils, which is independent of 
the length of the bridge-wire, and there is also lateral loss of heat through the 
explosive medium, this being directly proportional to the length of the bridge-wire. 
The results are, therefore, consistent with our original supposition that the term Bt 
in equation (1) does, in fact, represent the heat losses. Further, we have now arrived 



Hot-wire ignition 531 

at the interesting conclusion that the heat transmitted to the explosive medium per 
unit length of wire is the same at all points along the wire. 

Since the first term on the right-hand side of equation (5) represents the heat 
retained in the system, we can calculate, as before, the temperature of the wire on 
the assumption that the whole of this heat remains in the wire. This was done earlier 
for one length of bridge-wire, i.e. 1-51 mm., and, since Ejl is constant when ^ — 0, 
the temperature will be the same for all lengths of bridge-wire, namely, approxi- 
mately 500*^ C. In other words, the temperature is constant all along the length of 
the bridge-wire, and it is not surprising that both the rate of loss of heat at the ends 
and the rate of loss of heat into the composition per unit length of wire are constant. 

(b) Bridge-wire diameter 

The first series of experiments made with a view to examining the effect of bridge- 
wire diameter on the critical energy for ignition was carried out with a range of 
nichrome wires varying in diameter from 0*00126 to 0*00422 cm. The step length was 
kept constant with the object of preserving a constant bridge-wire length throughout 
the series, but, owing to difficulties in the manipulation of such fine and fragile wires, 
control of this and other experimental factors was not so effective as in the earlier 
experiments. One batch of fuseheads was made with each diameter of wire, the mean 
length of the bridge-wire in each case being estimated, as before, from the mean 
fusehead resistance and the known linear resistance of the original wire. The match- 
head composition was the same as before, i.e. 80 % of LMNR and 20 % of potassium 
chlorate, and the fuseheads were tested along similar lines. 

It will be seen from figure 6, where the critical energy per cm. length of bridge-wire 
has been plotted against excitation time for each diameter of bridge-wire, that the 
characteristic linear relationship is again evident, but the family of lines is not so 
well ordered as in the previous experiment, presumably due to uncontrolled experi- 
mental variables. The inconsistency appears to be associated with the slopes of the 
lines, with the consequence that the discrepancies are magnified at the higher excita- 
tion times. On extrapolating to zero excitation time, the critical energies are found 
to be arranged in a logical order, increasing progressively with increasing diameter 
of bridge-wire. These features accord with our hypothesis, which predicts that 
experimental conditions may well affect the leakage of energy, but the basic criteria 
for ignition by a standard method, or model, are characteristic of the explosive 
medium, which, of course, was constant throughout the series. However, the analysis 
of the quantity Q requires a knowledge only of the critical energies at zero excitation 
time, and the appropriate values for the different diameters of bridge-wire, obtained 
from figure 6, are plotted against the area of cross-section of the bridge-wire in 
figure 7. It will be seen that the results can be represented by a straight tme, the 
equation of which may be written 

Q ^ K^La, ( 6 ) 

where Eq is the critical energy corresponding to zero excitation time, a is the cross- 
sectional area of the bridge-wire and K and L are constants with respect to a. 

Thus the term 0 is shown to consist of two parts, one independent of the geometry 
of the heating element and the other directly proportional to the cross-sectional area 



532 


E. Jones 


of the bridge-we. The inference is that we have at last separated the two com- 
ponents of our ignition system, the term La referring to the bridge-wire and K to the 
portion of the explosive medium actively involved in an ignition. Accepting this 
view and bringing equation (6) back into energy units, we have 

jSq = LcJ/ = Lv, (7) 



excitation time ( x 10”® sec.) 

FrauBB 6. Relation between excitation time and critical energy for ignition. 
(Bridge-wires of different diameters shown in cm.) 


i 

I 

? 

o 




I 

I 




z 

y 


— 


0 


10 


15 


cross-sectional area ( x 10“® cm.®) 

7* Effect of area of cross-section of bridge-wire on 





533 


Hot-wire ignition 

where v is the volume of the bridge-wire. This, of course, is the energy equation for 
a completely insulated ignition system, and we now see that the energy actually used 
in igniting the explosive medium is Kl which, as might have been expected, is directly 
proportional to the length of the ignition system. The heat contained in the bridge- 
wire at ignition, and so not effectively participating in the ignition of the explosive 
medium, is, on the contrary, directly proportional to the volume of the bridge-wire. 
Moreover, the constant L has the dimensions of heat per unit volume and, with wires 
of the same material, i.e. of the same thermal capacity per unit volume, this means 
that the temperature of the wire at ignition is constant krespective of the length or 
diameter of the bridge-wire. In other words, we have not yet succeeded in making 
any change in the bridge-wire which has had any effect on the temperature obtaining 
at ignition. We are thus led to conclude that this temperature is not in any way 
affected by the characteristics of the bridge-wire and must, therefore, be a character- 
istic of the explosive. In other words, the ^ignition temperature’, as postulated by 
the thermal theory of ignition, is a physical reality. 

Having apparently isolated the heat contained in the bridge-wire, it is now possible 
to make a closer estimate of its temperature, i.e. the temperature at which it ignites 
a mixture of four parts of LMNR to one part of potassium chlorate bonded together 
with nitrocellulose. We obtain L from the slope of the line in figure 7, and this is 
approximately 1450joule/cm.^, or about 350 cal./cm.®. The thermal capacity of 
nichrome being approximately 0-92cal./cm.®/°C, the temperature of the wire at 
ignition would appear to be about 380® 0. This is only a rough estimate, which ignores 
such factors as the temperature coejSBicients of resistance and specific heat, but as 
the present discussion is primarily concerned with the principles of ignition, numer- 
ical figures are of secondary importance. The significant point is that, whereas our 
first approximation gave a value of 500® C, our second gives 380® C, and we may 
further anticipate that, when the energy equation is fully established, this too may 
have to be amended. 

(o) Bridge-wire material 

The results have now led to the following formulation of the energy equation for 
ignition of a solid explosive medium by hot wires; 

JE = {Kl -j- Ia!) -j- {E[ 4* JV) t = -f- Jcty (8) 

where — Kl-^Lo (7) 

and Xj = JET 4- elZ. (9) 

Equation (9) gives the rate of dissipation of heat outside the ignition system. No 
attempt will be made here to analyze this factor, largely because it plays a secondary 
role in the ignition process and its consideration in detail would complicate the 
discussion unnecessarily. It may play a more prominent role in the propagation 
of ignition, that is, if, as is commonly supposed, the unignited material is brought 
to its ignition point by heat conduction. 

Taking equation (7), we have deduced that the term Kl refers to the explosive 
component of our ignition system and, for a given length of bridge, may therefoire 



634 


E. Jones 


be expected to be indifferent to variations in the material of the bridge-wire. Hence 
dividing throughout by I, we have ’ 

EJl = K+Lvll, 

where K is now a constant with respect to bridge-wire material. Also, according to 
our interpretation, the term Lull represents the heat retained in unit length of 
bridge-wire and equals the rise in temperature, T, multiplied by the thermal capacity, 
c, of the wire per unit length. But we have deduced that, for a given explosive medium' 
T is constant, so that, if we vary the bridge-wire material, keeping the explosive 
medium constant, equation (7) becomes 


Eoll = K+Tc, 


(10) 


where T is now a constant. Thus, by plotting EJl against c, we should, if our infer- 
ences are correct, obtain a straight line, the mclination of which is the actual rise in 
temperature of the wire. 

strictly speaking, these predictions are based on the assumption that the geo- 
metry of the ignition system is not significantly altered, but it is not easy in practice 
to preserve the same dimensions of bridge-wire and still provide a reasonable range 
of variation in thermal properties. Nichrome, platinum and copper wires of com- 
parable ^ameters were conveniently available, but the lead and tin wires used were 
much thicker, the former being approximately seven times and the latter about four 
times the diameter of the first three. These materials afforded a wide range of specific 
resistance, specific gravity, specific heat and thermal conductivity, besides which it 
was hoped that the low melting-points of tin and lead, at which fusion of the wires 
might be expected, would provide an independent check on the temperature. Using 
these wires to form the bridges and 80/20 LMNR/potassium chlorate as the match- 
head composition, a series of fuseheads was made and submitted to the usual tests. 
Some difficulty was experienced in manipulating the lead and tin wires so that a 
longer bridge had to be accepted in these two oases, the bridge-wire length being 
7-7 to 7-8 mm. compared with 1-4 to 1-5 mm. for nichrome, platinum and copper. 
For each type of fuseheM, the mean firing currents at four or five different applica- 
tion times were determined, but the range of application times obtainable with the 
tm and lead wffes had to be limited because the fuseheads required firing currents 
^eatly exceeding any contemplated when the apparatus was designed. The results 
obtained ^e represented graphicaUy in figure 8, where the critical energies per unit 
length of bridge-wire have been plotted against the excitation time. 

Except for the lead wire, which required very heavy firing cmrents and gave 
rather erratic results, the characteristic straight-line relationship between critical 
energy for igmtion and excitation time is again reproduced. In the case of the lead 
wire, the hne^ been drawn in what is considered the most likely position and can 
onty be regarded as a plausible representation of the experimental results. 

The vate of EJl, expressed in cal./cm., have been plotted against the thermal 
^pamty of the wue/cm. length in figure 9, andit wiUbe seen that the results suggest 
ear relafiomhip between these two quantities, as predicted. The slope of this 

hne. which shouldrepresentthe temperature ofthewireatignition,givesanumerieal 



535 


Hot-wire ignition 

value for the uncorrected wire temperature of approximately 310° 0 , which compares 
reasonably well with the previous figure of 380° 0 considering the magnitude of the 
quantities of heat these measurements involve. Again, by extrapolating to zero heat 



excitation time ( x 10~*see.) 

Figtjbe 8.' Effect of bridge-wire material on critical energy for ignition. 



O 

O 



thermal capacity of bridge-wire ( x 10'“®caL/cm.) 

Fioueb 9. Relation between critical energy for ignition and thermal capacity of bridge-wire. 


capacity, W6 can eliroinatc tiie substance of tbe wire — ^but not its diameter on this 
occasion — and find how much heat is required to ignite the explosive medium when 
none is retained by the wire. The numerical value of this critical quantity of heat is 




536 


E. Jones 


approximately 2-5 x 10-3 which is of the same order of magnitude as the previous 
estimate of 0-6 x 10-3cal. It is di£6lcult to say at this stage whether or not the dis- 
crepancies between these numerical estimates are significant, but the fact that an 
increase in the quantity of heat attributed to the explosive medium is accompanied 
by a drop in the temperature of the wire is rather disturbing. This point will be 
reverted to later. 

Another interesting test which can be applied to the results is to calculate, as 
before, the temperature of the bridge-wire on the assumption that no heat is giyen up 
to the explosive medium. The results of this calculation are represented in figure 9, 
and it will be seen that, with increasing heat capacity of the bridge-wire, the apparent 
temperature at first falls extremely rapidly but soon approaches a steady ralue 
which agrees with the figure given above, namely, approximately 310° C. Thus 
extrapolation to infinite heat capacity gives much the same result as extrapola ting 
to zero heat capacity, which further emphasizes the fact that the temperature of 
the wire at ignition has little to do with the characteristics of the bridge-wire. 

The attempt to estimate the igmtion temperature from the melting-points of the 
wire materials, on the assumption that a wire fuses at or near its melting-point and, 
by baking the circuit, arrests any further rise in temperature, was not successful,' 
possibly because the molten metal was retained in position by the rigid matrix 
surrounding it. On the other hand, the possibihty that the actual temperature of 

Ignition may be much lower than these approximate estimates cannot be entirely 
ignored. ' 


Fusehead com/position 

It has been shown that, to effect ignition of a solid explosive medium by a heated 
filament, it is necessary not only to establish a certain critical temperature but also 
to communicate a certain critical amount of heat. Whereas the first criterion follows 
from the thermal theory, the significance of the latter is not immediately obvious 
If, however, we assume that the temperature of the explosive component of our 
Ignition system is constant, the critical quantity of heat corresponds to a critical 
quantity of explosive heated to the critical temperature, and we have now introduced 
a geometrical factor. > 

The ignition system has finite dimensions and must be expected to remain finite 
even when the heating element tends towards vanishing point. At this point the 
system has been reduced to its most elementary form; the source of heat is a mathe- 
maticd line having no substance, and the ignition system contains explosive medium 
c^y. If we now make the bold assumption that this limiting size is determined by 
the geometry of our chosen model, i.e. that it has no connexion with the thermo- 
chei^try of ignition, we can treat this factor as a constant with respect to the 
explosive medium. Interpreted broadly, this means that, with a line source of heat 
the amomt of explosive concerned in an ignition is the same whatever explosive 
and the cnticaJ quantity of heat wiU be proportional to the critical temperature 

for all e:^losivemedia.Whfle this maybe far too simpleaview of the ignitionprocess. 

It pro^des a basis for analyzing the relationship, if any, between these two critical 
quantities when the explosive medium varies. 



537 


Hot-wire ignition 


The explosive media used for this purpose were (a) copper acetyUde, and (6) a 
mixture of five parts by weight of potassium chlorate to one of birchwood charcoal, 
the sohds being bonded together, as before, with nitroceUulose. The preparation 
and testing of the foseheads followed the same lines as those described previously. 
The critical energies at zero excitation time, expressed in cal./om., have been plotted 
against the thermal capacity of the bridge-wire, also ei^ressed in cal./cm., m 
figure 10 for the two compositions concerned. This graph also includes the earher da a 



thermal capacity of bridge-wire ( x 10“* cal./cm.) 


for iho LMNE/oUorote oomjositioo obtatood (1) by bridge-we 

and (2) by Jyi.* bridge-we material, rigor. 10 thos summanoee the reeolta 

“’’Swore.'aeStoXp Sween the cririoal energy for igniKon and to tomd 

oa^olty of the bridge-irire ia linear for a given eiploaive, hut to ^ 

tot to relationahip varies from one explosive to moth®. ^ 

alone of to line was found to be independent of to ohaiaotenstios of to bn^wixe. 

we had aWy deduced that it was probably a funefcn of to 

and it is now shown that this inference was correct. Again, to slope of to 

Til^ measure of to apparent tmnperature of to bridge-wne at rgmtron ^4 

LSny our attention to to results obtained by extrapolation to zero wne 

m^e tod tot to critical temperatures for to IMNE/ehlorate. copper acety- 




538 


E. Jones 


lide and cliarcoal/chlorate compositions are respectively 380, 440 and 740® 0 approxi^ 
mately. Associated with these critical temperatures we find critical quantities of 
heat of approximately 0*5, 0-6 and 1-0 x lO-^cal./cm. respectively. It will be noted 
that these two criteria for ignition bear an approximately constant numerical ratio 
one to the other and, while some doubt is felt if the accuracy of the present method 
justifies this extreme step in the analysis, this does suggest, as we have seen, that 
with a line source of heat, the amount of explosive concerned in an ignition is constant. 
If the temperature depends on the explosive and the quantity on the source, then it 
may be inferred that, in the ignition of an explosive medium by a source of heat 
conforming to a particular geometrical model, certain geometrical requirements are 
superimposed on those arising from the physico-chemical properties of the medium 
itself. 

Referring to the results obtained with bridge-wires of dijQferent materials, it wiU 
be appreciated that, in this experiment, the substance of the wire was made to 
vanish without eliminating its size and shape, the result being that our ignition 
system was left with an empty space along its axis. According to our results, this 
had the effect of slightly reducing the critical temperature and substantially in- 
creasing the critical quantity of explosive concerned in an ignition. This suggests 
that the critical amount of explosive tends to increase with increasing diameter of 
the heating element, and this obviously throws some doubt on the validity of our 
assumption that the whoje of the energy represented by the term Lv is contained in 
the bridge-wire. If part of this energy is actually located in the explosive medium, 
our estimated temperatures are still too high, and, until this point is clarified, the 
calculated temperatures can only be regarded as approximations, the true tem- 
peratures being probably somewhat lower. 

Disoussiok 

Taking the results of the investigation as a whole, it may be concluded that the 
ignition of a solid explosive by a heated filament embedded in the medium is a purely 
thermal process. When suitable precautions are taken to avoid conversion of heat 
into work, the energy equation takes a simple form which may be interpreted as 
showing that the critical factor, in so far as the explosive is concerned, is the tem- 
perature. The geometry of the heating element probably determines the amount of 
explosive which must be raised to the critical temperature to ensure ignition. These 
criteria are independent of the time factor, and the increase in ignition energy ob- 
served with increasing time of heating is attributable to heat losses sustained by the 
ignition system during the heating period. 

Strictly speaking, these conclusions apply only to ignition under the conditions 
specified, and it is well to bear in mind the limitations of the present investigation. 
In the first place, the experiments were confined to a range of soKd explosives, and 
an extension of the work to other physical states of matter would seem desirable. 
Moreover, as the sohds were bonded together to form a rigid matrix, the ignition 
system may be described as static, whereas ignition often occurs under conditions 
which, for contrast, might be termed dynamic. To quote the illustration used in the 



Hot-wire ignition 

introduction, heat cannot be generated in a gas at constant pressure without at the 
same time causing expansion and doing external work. In such a case the energy 
equation must contain the appropriate thermodynamic terms. Furthermore, the 
present model is based on a line source of heat, and it would faoihtate mathematical 
generalization if data were available for another simple model, such as a pomt source 
or plane surface. 

Granting that the above conclusions have only a limited apphcation, it is never- 
theless very remarkable that the energy equation for ignition appears to contain no 
term involving the heat of reaction. It would appear, therefore, that the heat of 
reaction does not enter into the process of ignition to any significant extent, and it 
must be concluded that, under these conditions of ignition, the specified criteria must 
be completely fulfilled before the exothermic phase of the reaction begins. Pre- 
sumably, therefore, ignition occurs suddenly and simultaneously throughout the 
ignition system whenever the critical state is reached. 

In conclusion, the author wishes to acknowledge that the experiments describ^ 
in this paper were designed and carried out by Dr H. P. Stoiit, who was also largely 
responsible for the development of the experimental methods. Tha^ are also due 
to Messrs G. Richards, I. G. Gumming and W. 0. Wamock for assistance given at 
v^irioiis stages in the investigation. 


Reference 

Morgan, J. D. 1925 PM. Mag* 49, 323. 



Heisenberg’s S matrix for a system of many particles 


By El. J, Eden", Peterhouse, University of Gwmhriige 
{Gommunicated by P. A. M, Dirac, Received 12 March 1949) 


ereatioa and armiMlation of particles is forbidden, the general features of a many- 
particle collision are illustrated by a scattering system containing two particles and a fixed 
scattering centre. The^ term in the wave function, which is called by Moller the ‘outgoing’ 
part,is shown to contain a term representing a totally outgoing wave, but also terms describing 
the interference effects between the incident wave of one particle and the outgoing wave 
of the other. Corre^nding to the latter terms, there exist singular eigenstates of the S 
matrix wMoh are simultaneous eigenstates of the separate kinetic energies of the particles. 

The remaining eigenstates are called non-singular, and for these only the sum of the kinetic 
energies can be given a definite value. Analjrtic continuation of the non-singular eigenstates 
in the complex plane of total kinetic energy shows that the corresponding eigenvalues of S 
can be used to determine the energy levels of states with both particles bound to the centre 
of force. The eigenvalues for the singular eigenstates will lead to the bound energy of a single 
p^icle in the scatterii^ field of the other. The formalism is extended to include singular 
eigenstate which describe the scattering of one particle on a compoimd centre made up of 
the other particle boimd to the scatterer. 

1. iNmODTTCTIOK 

The usual form of quantum theory contains many quantities which are not observ- 
able. These quantities are introduced as auxiliary variables for convenience in 
calculation and are only indirectly related to experimental results. 'F.Yn.mplAp are 
wave functions or dynamical variables which are written as functions of space-time 
and therefore can be exactly localized. Such localizability would be invalid in a 
physical theory if, for instance, there is some imiversal constant which playu the 
part of a minimal length. 

He^nberg (1949) has suggested that the divergence difaculties of quantum-field 
theories are a direct consequence of the use of localizable variables. However, it 
seems essential that those parts of quantum theory which are directly related to 
experimentally observed quantities must also be incorporated in any new theory of 
elementary particles. The theory of the characteristic matrix, or 8 matrix, is an 
a^mpt to set up a firamework for a future theory which wiU contain none of the 
divergences but all the experimental results of present quantum mechanics. 

The 8 matrix is based on the idea of a stationary collision in a system of particles, 
and Its matrix elements are closely connected with collMon cross-sections for various 
proc^ses. The original formulation of the theory was given by Heisenbetg (1943 a, b), 
and the general properties of 8 have been studied by MoUer (1945, 1946). There are 
two mam approaches to the development of ^-matrix theory. One is to study the 
properties of the 8 matrix obtained from non-relativistic quantum mechanics, and 
show that it will lead to aE experimental results which have previously been obtained 
^m a Hamiltonian. In addition to cross-sections these include the energy levels 
of bound stet^ of two or more particles and radio-active decay constants. The second 
approach is to obtain properties, from non-relativistic quantum mechanics, which 
are restrictions on the 8 matrix itself. Such properties must be independent of the 

[ 640 1 



541 


Heisenberg^ s 8 matrix 


particular form of Hamiltoman, and may then be assumed to apply even if no 
Hamiltonian exists. The most important of these are the unitary condition which 
enables 8 to be interpreted as a transformation matrix, and the relativistic invariance 
of the eigenvalues of 8. Both these are general conditions; the outstanding require- 
ment is a special condition, which depends on the particular dynamical system 
under consideration. If 8 can be determined from these conditions, aided by a need 
for simplicity, they would form the basis for a new theory. 

In this paper the author will be concerned principally with the first method of 
approach. The properties of a many-particle system are studied, with particular 
reference to the energy levels of bound states. 

The Lorentz invariance properties of 8 have been studied by MoUer (1945) for 
a collision of two particles, and he has made a formal extension to a many-particle 
collision. If one is not concerned with Lorentz invariance, the co-ordinates of the 
centre of gravity may be eliminated. The system is then mathematically equivalent 
to one in which particles are scattered on a fixed centre of force, with conservation of 
energy but not of momentum. Throughout this paper the author works in terms of 
a system of this type. 

In their papers on ^-matrix theory, Heisenberg (1943 a) and Holier (1945) have 
assumed that the wave function for scattering may be put in a standard form having 
two parts, one representing the incident wave, and the other the ‘outgoing’ wave. 
For a system of one particle and a fixed scattering centre, Dirac (1948) has given the 
justification for this standard form of the wave function. The ‘outgoing’ wave does 
in fact correspond to the particle moving outward in the asymptotic region of co- 
ordinate space. It is shown in this paper, for a many-particle system, that the 
assumption of the standard form is related to the physical assumption that the 
particles interact only in a finite region of space. This implies that no Coulomb forces 
are present. The ‘ outgoing ’ part of the wave function corresponds to a state in which 
at least one particle is moving outward in the asymptotic region of co-ordinate 


For the scattering of one particle, Heisenberg ( 1944 ) has shown that analytic 
continuation of the eigenvalues of 8, in the complex energy plane, will lead to ^e 
energy levels of closed states in which the particle is bound to the scattering centre. 
The main purpose of the present paper is to extend the work of Hei^berg to 
systems of two or more particles with a scattering centre. It is assumed ttooughout 
that the number of particles is a constant of the motion. A system m w^ch paxtic^ 
can be created and annihilated seems to present £f-matrix theory with major diffi- 
culties which can probably be overcome only by new assumptions Some study of 
this problem has been made by Hu (1949). ^ assumptions lead to a senous 


negative-energy diflS.culty, . . i 

It is shown first that the wave function, for scattering of two particles no 
mutual interaction on a fixed centre, may be put into the standwd for^ Ths 8 
matrix for the compound system is the product of the separate 8 matoc^. s 
result can be extended to an arbitrary number of particles. After oo^den^ two 
particles having small mutual interaction, the interpretaton of the o^omg 
^rt of the wave function is examined. This is made up of interference terms between 



542 R. J. Eden 

' incident ’ waves of one particle and outgoing waves of the other, as well as the totally 
outgomg wave. The ^incident’ wave of a particle is defined as the state in which this 
particle does not interact with the rest of the system; it will contain both incoming 
and outgoing parts. The product of the incident states of all the particles is called 
the initial state of the system. 

In order to investigate bound states of a system containing two particles and 
a scattering centre, a superposition of states is formed. The basic states of the 
superposition correspond to incident plane waves of equal energies. The weight 
function for the superposition can be chosen so that the total state is an eigenstate 
of the 8 matrix. Then the weight function can be called an eigenfunction of /S; it 
determines the initial state of the system, which differs from the final state only by 
a phase factor. This phase factor is an eigenvalue of 8, 

Since the total energy W of the system commutes with 8, every eigenfunction of 
8 and W will contain a ^-function factor in the total energy of the two particles. 
Physically this means that, in a collision which is an eigenstate of 8, one can know 
exactly the asymptotic value of the total kinetic energy. If, for a particular eigen- 
state, one knows the separate asymptotic values of the kinetic energies of the two 
particles, the eigenfunction will contain two function factors, one for each kinetic 
energy. I call this a singular eigenfunction. It corresponds to a state in which the 
exchange of energy between the two particles is zero, so that asymptotically the 
separate kinetic energies are conserved. A necessary condition for the existence of 
singular eigenfunctions is that 8 contains terms having two Munction factors in the 
separate kinetic energies. Such terms are always present in 8, either as a product of 
incident waves, or as interference waves between one particle incident and the 
other outgoing. An eigenfunction which contains only the one Munction factor in 
total kinetic energy is called a non-smgular eigenfunction. The existence of such 
eigenfunctions is ensured if the separate kinetic energies do not commute with 8. 

By obtaining the asymptotic form of the wave function in co-ordinate space, we 
can investigate the conditions for bound states. For a non-singular eigenstate of 8, 
analytic contiauation of the corresponding eigenvalue, in the complex plane of total 
kinetic energy, leads to the energy levels of states with both particles bound to the 
centre of force. For a singular eigenstate, keeping the kinetic energy of one particle 
fixed, analytic continuation of the eigenvalue leads to the bound energy of the 
other particle. 

The important case, where the fixed energy value is the bound energy of one 
particle, can be included in this scheme. It corresponds to a state in which one 
particle is scattered on a compound centre made up of the other particle boimd to 
the scatterer. Since it is a singular eigenstate, ionization of the bound particle does 
not take place. When the total energy of the system is too low to allow ionization, 
the only possible states are a superposition of these singular eigenstates. With their 
corr^ponding eigenvalues these states define an 8 matrix which I call the ' partial' 
8 matrix. It is not analytic and indicates new difficulties in the theory which are 
considered in more detail in another paper. 

The possibility of a mixed eigenfunction, containing both singular and non- 
singular parts, is considered. It is shown that these parts may be regarded as separate 



Heisenherg^s S matrix 543 

eigenfunctions of 8, belonging to the same eigenvalue. This has the consequence 
that the singular part must correspond to a state in which one particle is bound. 

Finally, some aspects of the extension to an arbitrary number of particles are 
discussed. It seems unlikely that any new points of principle would arise in making 
this extension. 


2. Two-particle scatterestg with ho muttjal ihteractioh 

Units are taken in which % = c = 1. Denote the mass, momentum, and kinetic 
energy of a particle by for ^ = 1, 2. Then Wl = 4 + 

We consider two-particle scattering on a fixed centre of force, so that energy but 
not momentum must be conserved. Since there is no mutual interaction, the system 
can be treated as the product of two single-particle scattering systems. In the 
momentum space of one system the incident wave can be represented by a ket 
vector I k^>, denoting a plane wave with momentum k^. Then the wave function 
takes the form 

{K\fn\ = ^(k„-k^) + <S+(F,- Wi) <k, 1 r„ I k^>, (2-1) 


where 




and 8{x) denotes Dirac’s 8 function. 

For all possible initial values k^ of the momentum, and aU possible values k^, 
the wave functions (2-1) form the wave matrix Since the particles 1 and 2 have 
no mutual interaction, the wave function for the two-particle system is 

<kik, I f 1 kf^k#) = <ki I 1 k^> <k2 1 I k#>, (2.3) 

•where the initial state is | k/> | kf > = | k^k/>. 

Since, ia forming (2-3), 'we shall be concerned with products of improper functions, 
it is necessary to use a more rigorous definition than (2*2) for ^±(a!). This is defined 
only when taken imder an integral with respect to x, by 


r ^±(a)da = ^. lim f” , 

J_oo 27rio-»-+oJ-oo ±a+*o- 

f d{a)da= I* {^+(o)+^_(a)}da = i lim 


2icrda 


These are equivalent to the usual definitions. In a product of 8 functions, the limit 
<r-> 0 must be taken last, after all integrations have been carried out. The following 
formulae are required: 

J<y+(a+6)[^>) + <^+(^)] = j<y» W (2-6) 

js{a + b) [5+(«) + d^b)] = J5(a) 5(6), (2-7) 

where the j* sign denotes Jj£i!ad6 and a and b are independent variables. Using (2-4) 
the left side of (2-6) gives 

j5+(a) 5+(6) ^^2 J (a + ia) {b + icr) (a + 6 + ia) ' 


Vol. 198. A. 


36 



544 


R. J. Eden 


The last term is zero, since the integrand is of order 1/a® as a-»oo in any direction, 
and the integrand has no poles in the upper half-complex a plane for <r > 0. The left 
side of (2'7) gives 

^ It r 2icr(a-t-6) 

J (J(a + 6) d(a) - —2 J j + i^r) (a + 6 - icr) (a - icr) (6 + iar) ' 

The last term gives zero -when the integration is carried out with respect to a and 6. 
This proves (2-7). With the aid of (2-6) the wave function (2-3) can be written 

<kik2 1 f 1 k^k#> = <kik2 1 1 1 k^l^> + S4W- W^) (kika 1 r | ki^k^^), (2-8) 

where 

(kika 1 r 1 k^k^> = <y(ki -k/) <k2 1 ] k#> + 5(ki -k#) (k^ ] ] k^> 

+ iW- Wt) + 84W^- Tf#)] <ki 1 1 kf}(k, I | lei}. (2-9) 

In (2- 8) Tf = -1- + V(^i + '^i) ^ kinetic energy. The S matrix 

corresponding to (2-8) is defined by 

{k^k, I S I k^k#> = <kik2 1 1 1 k/k#>-F <y(W- W^} <kik, I r I ki^k#) (2-10) 

= <ki|^ilk^><k,|^,|k#>. (2-11) 


In obtaining (2*11) we have used (2*7). In symbolic form this equation may be 


written 


8 = 8^8^, 


( 2 - 12 ) 


This result is intuitively obvious, so that any other would cast doubt on the applic- 
ability of the theory to this particular system. But its proof is not trivial; for 
example, Heitler’s theory of radiation damping leads to a different result, and must 
therefore be inapplicable to this problem. Heitler’s integral equation, accordiug to 
Wentzel (1947), can be written 

B = 2mK+niKB, (2-13) 


where the matrix elements of E are the first non-vanishing terms of an interaction 
matrix, and 


8 = l + B. 


(2-14) 


In our example (2- 13) holds for the product system, and since there is no interaction 
between the particles, 

E = Zi-t-Za, (2-15) 


= 2mEn+mE.nRn «■ = 1, 2). (2*16) 

This gives B = {Bi+B^+B^B-ijUl-lB^B^), (2-17) 

which contradicts (2-12). 

The result (2-12) caimot be directly generalized to a S3ratem of many particles by 
induction, since the r matrix for many particles will contain a number of subsidiary 
terms with S and factors, which might invalidate our calculations. The direct 
proof wiU be outlined here. (2-6) and (2-7) generalize to 

k( i oz) i: n ^+(az) = f n w, (2-i8) 

J \Z~1 /m«=lZ=}=m J 1=1 




(2-19) 



545 


where the J sign denotes 
Left side of (2-18) 


{2m)” 


Heisenberg’s S matrix 
Jj* ^da-^da^...dan, and a^,a^, are independent. 

T ^ ^ {n-l)i(T 1 

I n (oj + *0-) Oz + *crj n + i<r)j ' 


The second term in the integrand is of order Ijaf for any o^, as o^-^co in the complex 
plane. It has no poles in the upper half Oj plane, and gives zero contribution to the 
integral. (2-18) follows. If (2-19) holds for «, then 

/* / n \ n n /*»+! 

•JK+i) s s n w = n m- 

J \Z=1 /OT=lZ+m J Z=1 

r /n+l \ n n 

Leftside = {^+K+i)+<J_(a„+i)}^( S«i) S r[^f(«z) 

J \Z«1 /m—ll=¥m 

r( / M M+1 \ n n 

= R+K+i)+M s«z K s«z 2 nM«z) 

J 1 \z=l /J \ Z =1 J 

= left side of (2-19) with % + 1 for n. 


It follows that {2-19) is true for aU n. 

The wave function for a system of n non-interacting particles, written in the 
product-momentum space, takes the form ■ 

= n <k^ 1 iri 1 k/>. (2-20) 

Using (2-18) this can be written 

= <ki...kJllkf...k^> + ^^.{Pr-Tf^)<ki...k„|rlk^...k^>. (2-21) 


The S matrix is given by 

(2-22) 

With the help of (2*19) this gives 

(2*23) 


3, Two-particle soattebing with small MUTtrAL intbraotioh 

When the interaction between two particles is treated as a small perturbation on 
the system of § 2, we can investigate the new form taken by the S matrix. Denote the 
unperturbed wave matrix by i/tq. Its matrix elements satisfy a wave equation 

W) <kik2 1 fo I ki^k#) = (k^k^ I Vfo I ki^k^^), (3-1) 

where F = is the sum of the separate interactions of the particles with the 
scattering centre. Let the wave function <kik 2 1 rjr | k^k^y describe the state which 
develops adiabatically under a perturbing potential v. v refers to a small interaction 
between particles 1 and 2. Then 

(W^-W) (kjka I f I k^k^> = (kika \{7 +v)\lr\ kf k^>. (3-2) 

36-2 



546 R. J. Eden 

Put = tjr^+^. Then to first order in v and 

(PT-t- W) <kik, 1 ^ 1 k^l^> = iKK ! Vi>+vir, 1 k^k#>. (3-3) 

Without loss of generality we can take V<p = 0, since this involves only a re-choosing 
of the unperturbed state i/tq. To see this, put ^ = ^1+^2 where = 0. Then 
satisfies (3-1) if ^2 satisfies 

(W^-W) (kikj 1 56 1 kj^k#> = (kika | I (3-4) 

Hence 561 can be added to the unperturbed wave function t/tq. 

It is assumed that particles 1 and 2 interact only in a finite region of space sur- 
rounding the scattering centre. Then the correction (j> to the wave matrix will be 
of the form 

{kjka I (j> 1 W^) (kikg | vfo \ kf k^>. (3-5) 

Hence 


{k^kgl ^|k^k^) 

= <kik2 H I k^ k#> - 2mS4W- W^) <kik2 1 Vfo + vfo I (3-6) 

On going over to the S matrix this gives 

{kika I s I ki'li^) = (kika 1 8o \ k^k^> - 2niS{W- W^) {k^k^ | vf^ \ (3-7) 

This gives for the perturbed r matrix 
(kikg I r I kf‘k#> = - 27ri<kik2 1 v | kf* k#> 

-2;7i J<kik2 1 « |kf‘k^>dk'5+(Tf' - Wt){K I ^2 lk#> 

-2mj<k^k^ 1 1; I kik^>dki^^(lFi- W^Xki 1 Ikfi) 

\ V I kik^>dkidk'{dXWi- Wt) + 8+{W'^-Wi)} 

x<ki|ri|k^><k'|r2lk^> 

+ <kik2|ro|k^l^>, (3.8) 

where is the unperturbed wave matrix given by (2-9). We see jfrom (3-8) that the 
perturbation causes an essential change in the r matrix, even when the energy shell 
condition W = TF-^ is applied. K F = TJ -+-1^ is also small, of the same order as v, then 


where 

Then 


S^=\-2mV, 

<kik2 1 F 1 k^kl*) = ,J( W - W^) (k^kj [ V | kj^k#). 
8 =8q{ 1 — 27riv) = ( 1 — 27rw) 8q, 


(3-9) 

(3-10) 

(3-11) 


since the Poisson bracket [F, «] is now of second order in smallness. 

Sxcluding Coulomb fields, there would not be interaction between the two 
particles at infinite separation from the scattering centre, unless v could lead to 



547 


Heisenberg^ s 8 matrix 

binding between the two particles. If denotes the momentum of the compound 
particle in state (m), v will have matrix elements of the type | v | k^k^). Then 

<k^) I 8 1 k^k#> = - 2mS(W~ W^) <kif) I vi/To I kfk^} 

= - 2m^(W- W^) jj<k^> I V I kik^> dkjd]4<kik^ 1 ^0 1 kfk^>. 

(3-12) 

However, the perturbation does not give the whole change in /S', since the incident 
wave also will have to be modified. We will not investigate these types of transition 
in this paper. There is nothing essentially new in the generalization of this calculation ' 
to more than two particles. 

4. The physical interpretation op the wave function 

For simplicity it is assumed that no matrix elements of type (3*12) occur. Then the 
matrix elements of which are important at large separation from the scattering 
centre, are 

(k^k, I ^ 1 k^k#> = <k,k, 1 1 1 kfkf) + ^4W- (k,k, I r I kf k^>, (4-1) 

(k,n, 1 1 1 kfkf) = W^) (k,n, | r [ k^k#>. (4-2) 

The expression (4-1) denotes the pure scattering part of and describes a state in 
which neither particle is captured by the centre. (4*2) denotes a state in which 
particle 2 has been captured by the scattering centre, and the excess kinetic energy 
carried off by particle 1. Since we shall only be concerned with the behaviour at 
large distances from the centre, kinetic energy must be nearly conserved, so no states 
with both particles captured can occur. 

The assumption that the particles interact only in a finite region round the scat- 
tering centre means that the incident waves can be plane waves, and the initial state 
of the system is represented by (k^kg 1 1 1 kf k^>. This also implies that no Coulomb 
forces are present, only short-range forces act between the particles. When the wave 
function corresponding to the initial state is subtracted from (4-1), we are left with 
a part which should correspond to a state which has been affected by the scattering 
centre. That is, one or more of the particles must be outgoing in the asymptotic 
region of co-ordinate space. By transforming to co-ordinate space it will be. shown 
that this requirement is satisfied by the last term of (4*1). 

For one-particle scattering the corresponding requirement is that the asymptotic 
wave function should represent an incident wave plus a wave which is outgoing only. 
For two or more particles, in a steady scattering process, there are interference effects 
between the incident wave of one particle and the outgoing wave of another particle. 
For example, in (2*8) this interference is represented by the term 

W^) S(k,-kf) (k, I 1 kf). 

This denotes a plane wave in the momentum space of particle 2, coupled with a wave 
outgoing from the scattering centre for particle 1. We note that the incident wave 
^(kg— k^) is incoming on one side of the scattering centre and outgoing on the other 



548 


R. J. Eden 


side. Since the system is in a stationary state, the probabihty of any state is constant 
in time, so there is no contradiction in the fact that incoming waves of one particle 
TTfiiy with outgoing waves of the other even at infinity. Without such mixing of 
incident and outgoing waves we would not obtain a unitary S matrix. 

If in a many-particle system there is mutual interaction between the particles, 
there will be an exchange of energy at the scattering centre. Without this exchange 
of energy a totally outgoing wave would contain a product of Wf) factors, 

one for each outgoing particle. By formula (2*18) this reduces to the standard form 
with a — W^) factor multiplied by a term singular in the separate Wi at Wf, 
With the exchange of energy the essential factor W^) remains, but the other 

wiU be modified so that it is no longer singular in the separate WJ. We will see that, 
in the asymptotic region of co-ordinate space, such a term still represents a totally 
outgoing wave. 

Summarizing these remarks for a two-particle system: The second term in the 
wave function (4-1) wiU contain terms faUing into two classes. First, the interference 
between particle I outgoing and particle 2 incident, given by 

<ki I I k^>. m 


Second, a term which we shall see is totally outgoing, 




(4-4) 


where {k^kg [ / | kf k^> contains no singular parts such as d or terms in the separate 
particle energies. The representation of (4-3) and (4*4) in co-ordinate space is obtained 
by means of the transformation function " 

1 kikg) = (27r)-3 exp (ikiFd exp (ikgFg). (4-5) 


In Fg co-ordinate space (4*3) gives a plane wave exp (ik^Fg) with momentum equal 
to the initial value k^. The f^ dependent part is identical with MoUer’s one-particle 
system (MeUer 1946 ), and for large has the asymptotic form 




(4-6) 


where ' j angle variables in the direction of f^, and Af is the value 

of Aj at = Wf, (4-6) represents an outgoing wave only in f^ co-ordinate space. 

The most convenient method of transforming (4*4) to co-ordinate space is by means 
of the Heisenberg-MoUer d functions. These are written (y±(W, and d{WiW^) 
to distinguish them from the Dirac d functions. Their properties are set out by 
MoUer ( 1946 ), and only those formulae which are needed for specific discussion are 
given here. 

<y±(TF, W^) are defined when under an integral by 




f{W)dW 
Cixw^) {W— w^y 


(4-7) 



549 


Heisenberg's 8 matrix 

are contours in the complex W plane shown in figure 1. 

C+(W^) passes jfrom the rest mass k below the value W^, and to infinity along the 
real axis. 

passes above W-^, and to infinity along the real axis. 

When W-^ is real and greater than k these formulae are equivalent to (2-4). 

C(W^) is a closed contour running clockwise round W^. 



Figure 1 


These S functions are written as functions of two variables since the path^ of 
integration is modified to suit each value of The Dirac S functions, on the other 
hand, are functions of only one variable given by the real diJBference {W— W^). 

If then for real and greater than k 

lim exp {ihr) , W^) = 

r->oo [Oj ' ' 

since CL( W^) may be taken entirely in the upper half W plane, where the imaginary 
part of k is positive. Similarly, C^{W^) may be taken in the upper half-plane except 
for a small closed contour surrounding the point W = 

Transforming (4*4) to co-ordinate space we get for large 

(ikiri+ikar^) PT, W^) {kikj | / 1 kf k^> 

= (l^Jj dk,dW,d4W, W^)<JV^wtk,\f\-k.tki) 

If dk,dW,S4W, W^) (W^<oZk, I / 1 kf k^>, 

(4-10) 

where a)f and a>5f are angle variables in the directions and — respectively. For 
any fixed value of kg in the integrand of the last term, there is a factor 

exp ( - ik^r^) ~ 

The path of integration may be varied in the plane so that this gives zero, using 
the conjugate complex of equation (4*9). We now see why it is essential that/ should 
contain no singular factor like — T!^), since with such a term only one point 
of the path of integration of is important and (4-9) would not apply. This would 



650 


R. J. Eden 


give a non-zero contribution from the incoming wave exp(— iifciri). For a non- 
singular/, the asymptotic form of (4'4) for large is 


(27r)®ir; 


I f I (4.„, 


The superscript B denotes that ki and kg are, in the integrand, taken on the Mnetic 
energy shell, i.e. 

•) + V('f|+^l) (4-12) 


Similarly, if we had carried out the integration first with respect to k^, we would 
have obtained, for large. 






4 


dWy^ \d(j)y 


exp (ik^ri) exp {ikfr^) 

V(AJV(Af)^ 


( WyCOyWicotlflkfki ), (4-13) 


where 


W§=W^-Wy. 


(4-14) 


Hence it can be said that a non-singular term like/, in the r matrix, corresponds to 
an outgoing wave in the asymptotic regions of both and r2 space, that is, it repre- 
sents a totally outgoing wave. 

For scattering of n particles on a fixed centre of force no essentially new concepts 
are involved. The general form of wave function assumed by MoUer (1945) is 

<ki...k,|V^lkf...k^> 

= <ki...k„|l|k/...k^>-h<J+(Tf-TF^)<ki...kJr|kf...k^>. (4.16) 

The last term corresponds to a state in which at least one particle is moving away 
from the scattering centre. It can be split into terms, some of which refer to a mixture 
of incident waves of some particles with outgoing waves of others. 

The matrix element (4-2), which corresponds to capture of particle 2, can be 
written | ^ | (4.I6) 

This represents an outgoing wave in co-ordinate space, with kinetic energy asymp- 
totically equal to where is the bound energy of particle 2 in state (n). 

In kg momentum space this term is by definition the Fourier transform of a term like 


5. The asymptotic wave EUNCTioisr eor ah eigenstate op 8 

In the previous section the discussion was restricted to initial states made up of 
products of plane incident waves. As a preliminary to the investigation of bound 
states, we now consider a general initial state for scattering of two particles. We take 
as basic states j kf k^), and superpose with a weight function ^^-(kf k^) S{W^ — W^). 
Tliis weight function ensures that the kinetic energy of the initial state is W^. The 
new wave function, which replaces (4*1), is 

f„(kik2) = JJ<kik2|^i^|k^]^>dk/dl^«(k^k#)^(TF^-TF0). 


(5-1) 



551 


Heisenbergs 8 matrix 

The suffix Vi on indicates that we are considering a particular state determined 
by the function v. The 8{W-^ — PF°) ensures that only values of u on the kinetic energy 
shell W-^ = are important. The factor {kjk 2 1 ^ | k^k^) in the integrand is given 
by (4>1). Writing 

<y(lF- W°) = (y_{lF- + FO), 

(5-1) can be put in the form 

f„(kik 2 ) = it(kik 2 ) SJW- W0)+v(k,k,) S^(W- W% (5-2) 

where 

^(kika) = ^{kikj) + JJ<kik2 1 f I kf k#> dkfdkf u(kfk^) S(W^ - F®). (5-3) 

Except in the asymptotic limit, given by (5-2), is not on the energy shell 

F = F®; but ■M(kika) can take arbitrary values for F+ F® without affecting 
^^(kika). The symmetry of form in (5-2) shows that i^Ck^ka) d(W — F®) will determine 
the superposition for a final state, in the same way as 'u(kaka) d(F— F®) gives the 
initial state. Multipl 3 Tng both sides of (5-3) by ^(F— F®) gives 

^(kika) 8(W- F®) = jj<k:ka | 8 \ kf k#>dkf dk#M(kf k#) ^(F-^- F»). (6-4) 


In the special case where ^(k^ka) S{W — W^) is an eigenfunction of /S, it may be » 
written as a transformation function, 

^(kika) W^) = {kikg j W^cxP), (5-5) 

where Tf , a are a complete set of commuting observables, which commute also with 
8. Hence 5^ = >S(ir,a) can be written as a function of them, and its eigenvalues 
written A( W^a?). Throughout this paper A will denote an eigenvalue of 8\ it is always 
a function of PF°, the total kinetic energy of the system. Equation (5'4) gives for 
this special case 

i;(kik2) (?( F - F®) = |J<kxka | 8 1 kf kf > dkf dk^<kf k# \ F»a®> 

= A(F®a®)<kik2lF®a»> 

= Att(kik2) (y( F - F®) . (5-6) 

A is of modulus unity, so the initial and final states differ only by a phase factor. 

In making a transformation of co-ordinate space, we again have to 

consider two possibilities. One is that 'non-singular’, that is, it contains 

no S factor in or Pf^. If (5*6) holds, then t^lk^kg) is also non-singular. Secondly, 
^(kikg) may contain a factor — Tf ?); then it is called a singular function. 

When u(kik 2 )^(TF— PF®) is a non-singular eigenfunction, the wave function in 
co-ordinate space is 

f ^(rira) = «(rira) + ^(rira), (5-7) 

where the asymptotic forms of u and v for large are 




exp (ikfr^) exp (tkara) 
'^{^1)^1 ■\/(^) 


^(Ffwfka). 



552 


R. J. Eden 


wf and (uf are the directions of and — respectively. The superscript A denotes 
that, in the integrand, kf and W\ s-re treated as functions of according to the 
energy shell relation 

Wf = V(«!+ ") = F’o- Tfa- {5-10) 

For large we obtain 


«(rir2)~ 

v(rira)~ 


1 fj exp(ikiri)exp 

1 r°""W {do> e^P(^kiri)exp(i^#ra) , 


(5-11) 

(5-12) 


where — The physical interpretation of (5-9) to (5*12) shows that 

2 ^(kik 2 ) 5_(Tr — W^) represents an incoming wave in both and co-ordinate space, 
while ^^(kikg) — W^) represents an outgoing wave. 

“ When w is a singular function, the weight function for the superposition will take 
the form 

uik^k^)SiW- W^) = x{k^k^)S(W;,^ TF§), (5-13) 


where ^^(kikg) is assumed to be a non-singular function. Physically a weight function 
* of the type (5-13) means that the superposition for the initial state is made up of 
waves each having 1^ = WJ and = W^. If an eigenfunction of 8 takes the form 
(6*13) it is called a singular eigenfunction. Then the weight function for the final 
state will take the same form : 


v(k^k^)d{W‘-W^) = J J<kikal ;S|kfk#>dkfdk#(r(kfk^)5(F^~TfJ)(5(lf#^Tf^ 

= A(FSTf§y?«)a:(kik2)^(iri- Wl)d{W,- Wl). (6*14) 

In § 8 the conditions for the existence of both singular and non-singular eigenfunc- 
tions of 8 are discussed, and it is shown that they are normally satisfied. A singular 
eigenfunction corresponds physically to a state in which there is no interchange of 
energy between particles 1 and 2. For such a state one is able to measure the asymp- 
totic values Wi, Wl of the separate kinetic energies. This distinguishes it from a non- 
singular eigenstate where only the total kinetic energy can be known. 

When the singular function u is not an eigenfunction of 8, 

v{k^k^)S{W-^W^)^y{k^k^)8{W^^W\)d{W^^Wl^ (5-15) 

Substituting this in (6*2) gives 

== 8{W^^ Wl)x{\k^)84W,^ wi) + S(W,^ Wl)y{Kk,)d4'%^ F§) 

+ z{k^k^) $4 W-W% (6- 16) 

In (5*16) the wave function has been split into terms with (l^— Wl) as argument. 
This is consistent, but has removed the symmetry shown in (5*13) between particles 
1 and 2. However, this has no effect on itself. The wave function is split in this 
way, so that later the bound states of particle 2 can be investigated by analytic 
continuation of W^, The last term in (5*16) is of non-singular type, so it will represent 



Heisenberg’s 8 matrix 


553 


an outgoing wave in both and co-ordinate space. When —W^) 

gives a singular eigenstate of 8, this last term is zero, and 

Transforming (5-17) to co-ordinate space gives 


For large r^, 


== a:(rir2)+2/(rar2). 


(5-18) 


1 rw'-Kt f 

*(rxr,)~ ■iHiJ*.. 


exp (ikiTj) exp {—ihtr^ 


(27r)®ir2 


exp (ikj r^) exp ( — ih^ r^) 

~V(A^) mjw~ 


x(kJVl(0^). 


5(lFi-'PF?)a:(kiTf#W2-) 
(5-19) 
(5-20) 


In (6-19) Wi+ Wf = W°, and in (5-20) Wi^+Wl= W°, that is, = Tf?, the initial 
and conserved value. (5-19) has been obtained by taking an integral over W^, with 
the path of integration in the lower half-plane except for a sraall circle round W^. 
Using the eigenvalue equation (5-14), we get also for large 






exp (ikiTi) exp 


X{WlWl^)x{]s.^Wlu)t). (5-21) 


V(Ai) 

Equations (5-20) and (5-21) show that the first term on the right of (5-17) represents 
asymptotically an incoming wave in space, and the last term an outgoing wave 
in space. It is easy to see that the two parts have no such interpretation in 
space. Transforming the first term on the right of (6-17) for large 


" (2^x J ex^p (ik^r^) S{W,- Wl)x{W,axTK) S.(W„ Fg). 

^ ^ (5-22) 

In (6*22) only the point = Tf J, of the path of integration of W^y is important. It is 
this fact that has given a factor instead of S_{Wy W^). Hence, unlike the 

non-singular case, we cannot make a variation of the path of integration to give a 
zero contribution from either term. Thus (5-22) represents both incoming and 
outgoing waves in co-ordinate space. 


6 . AlT-AiyTIC COKTESriTATIOK FOR A KON-SIKOULAR EIGEKSTATE 

In this section the behaviotir of the wave function (5-7) is investigated when 
analytic continuation is made in the complex plane, where is the total kinetic 
energy of the system. For a pure scattering problem, the asymptotic value of the 
kinetic energy of each particle must be greater than its rest mass Hence for 

scattering Ari-f/C 2 <F°. (6-1) 

It is supposed that the wave function (6-7) is known for the range (6-1), and the effect 
on the as 3 unptotie forms (5‘8) to (6"12) is investigated when F® is continued analytic- 
ally to the range 


0<F®</Ci-|-/C2. 


( 6 - 2 ) 



554 


R. J. Eden 


In the equations (5-8) to (5-12) both the integrand and also the path of integration 
are functions of W^. This enables us to obtain the behaviour of the asymptotic wave 
function when PT® becomes complex, provided we specify along what path W° is 
continued. This will define the new path of integration for WiOxW^. and satisfy 

= V(^!+ *=!). ^2 = ^(4 + 4h (6-3) 

W^+W,= Wo. (6-4) 


The Wf plane must be cut from to — in order to make Wi a single-valued function 
of the momentum We adopt the convention that both real and 

positive for W° in range (6'1). This fixes the Riemann sheet in which we start. 



Figttee 2 



Fl&tJKB 3 


We consider the behaviour of (6-8) when W° is continued by path [a] (figure 2). 
to values in range (6-2). The path of integration in the plane depends linearly on 
this change in W°. It changes from to A, by the addition of a path [aj (figure 3). 
If it is assumed that the integrand of (5-8) has no singularities in the lower half 
plane, the path of integration may be contracted to the direct path k^A^, taken below 
the cut on the real axis (/Cg to — k^. Since \ = | PTI — a:| |* exp {^d) and 0>6> —n 
on path [Ui], we have ^12 = 1 ^ 2 1 ®^P ( ~ along path k^A'. For fixed W° given by 
the value /Cl as varies along the path k^A^, Wf in the integrand of (6-8) will 
be given by Wf = Pf®— Pl^. PT® is real and Pl^ is in the lower half-complex plane. 
Hence in the integrand of (5-8), = +i\h{\ as goes along k^A^. This means 

that for large r-^, the wave function given by (5-8) contains a diverging factor 

exp (— rj) = exp (-f I kf ] r^) when PF® is continued by path [a] to real values in 
range (6-2). Similarly vir^T^) in (6-9) contains a convergent factor exp ( — | | rj 

for the same continuation of IF®. For large r^, we have to consider (6-11) auH (5’12). 
In these is the free variable whose path of integration to PF® — is affected 
directly by continuation of PT®. When PF® is continued by path [a] (figure 2), PP^ has 
a path of integration from and below the cut (/Ci to — /c^) to PF® — /Cg < For t.Tn'g 
path the dependent variable PF^ = PF®— PP^ moves in the upper half-plane, hence 
k^ = +i\k^\,in. the integrands of (5-11) and (5-12). 

Summarizing these results for continuation of PF® by path [a] (figure 2), 

«(rir 2 ) ~ exp ( I if I r^) 
rfiexp(iif jy-a) 

^(rira) ~ rfi exp ( - 1 if ] r^) 
ra-^exp(-|if jra) 


for large r^, 
for large r^. 

for large fj, 
for large r^. 


(6-5) 


(6-6) 



555 


Heisenberg* s S matrix 

We see that i^(rir 2 ) represents a closed state, in which particles 1 and 2 are bound to 
the scattering centre. If continuation of is made by path [6] (figure 2) above 
the real axis then 


w(rir2)~riiexp(-|&^|ri) 
~r^^exp(-| jj-a) 

for large 
for large r^.] 

(6-7) 

v(rir2)~rfiexp(|fcf |ri) 

for large rj,! 

(6-8) 

~?-^^exp(|*^ jra) 

for large r^.j 


In this case, represents a closed state with particles 1 and 2 bound. 

For continuation by path [a], if the wave function is itself to represent 

a closed state, we must have n{r-^Y^ zero. This will happen when there is a zero term 
in the integrands of (5-8) and (5*11), that is 

o> 5 fk 2 ) = 0 and u{kjW^a)2)^0. (6*9) 

These functions must be zero for all values of the variables, subject to Wf -4-1^= 
and respectively. Hence the zero must be associated with the 

particular fixed final value, say, of Tf®, where 0 < < /c^ + /Cg. From (5*6) we 

see that Tf must be a root of 

A{Troa<>) = 00. (6d0) 

Similarly, if is continued by path [b] (figure 2) to the range (6*2), is found to 
be a root of 

A(F<^aO) = 0. (6-11) 

is interpreted as the energy of a state in which both particles are bound to the 
scattering centre. 

These results show that, providing there exist eigenfunctions of the^ S matrix 
which are non-singular in the kinetic energy of one particle alone, the energies of 
the closed states may be obtained by analytic continuation of the corresponding 
eig€>nvalues. It will be seen later that in general non-singular eigenfunctions do exist. 

It is assumed that a correct S matrix, calculated by some future theory, will 
lead only to zeros which correspond to bound states, although it has been noted 
that redundant zeros with no physical meaning may occur when the S matrix is 
calculated from non-relativistic quantum mechanics. 


7. Analytic continuation for a singular eigenstate 

We now consider the effect on the wave function (5-18) when analytic continuation 
is made in the plane. In (5-22) we have seen that the term a;(rir 2 ) contains both 
incoming and outgoing waves in space, when the singular value is real and greater 
than /c^. The same is true for We consider first TTJ > /c^, and then W? < /c^. 

For large r^, analytic continuation of by path [a] (figure 2), corresponds to 
continuation of bypath [Ug] (figure 4). This leads to i;! = 1 ^ 2 1 (5-20) and (5-21) 

giving 


a:(rir2)~ri-iexp(|iElr2), 

~ ra 1 exp ( - 1 ^ I r^). 


(7-1) 

(7-2) 



556 


R. J. Eden 


Thus x{rj^r^} diverges, whilst j/Crita) represents a bound state of particle 2. We 
can see directly that when Wl moves into the upper half-plane from B, the path 
of integration G+iW^) can be taken entirely in the upper half-plane when associated 
with exp (ijfcgj-j), and entirely in the lower half-plane when associated with 
exp ( — iklr^). Thus a factor will lead to a bound state of particle 2, naing 

path [ag]. But CL(Tr2) for Wj at must encircle B^ before going to infinity, so a 
^-(1^, W§) factor will m general lead to a divergent term. 

If, for a particular final value of W%, 

x{k^Wl(j)^ = Q for = (7.3) 


plane 



cut plane 



Figukb 4 


Then the wave function will itself represent a state in which particle 2 is 

bound and particle 1 free. The eigenvalue A(WiPr§/5®) which corresponds to a sin- 
gular eigenstate is a function of both Tf J and W^, since for this state both kinetic 
enerpes can be known. Hence for each fixed value of Ff, the bound states of 
particle 2 have energies satisfying 

= 00, = i\ 4™) |, (7.4) 

A{FSF^")/?o) = 0, = (7.5) 

The solutions will in general be dependent on the particular fixed value of FJ; 
that is, F^»> is the energy of particle 2 bound to the centre and subject to the field 
of the scattered particle 1 of energy FJ. For F? > /c^ the dependence of F^“> on FJ 
will probably be slight, but it is certainly important for W\ < k^. The eigenvalues 
^ which refer to angle variables will also affect F^"\ 

When FJ = is the bound energy of particle 1 and the scatterer, the 

sing^ar eigenstate has a special physical significance. It refers to scattering of 
particle 2 on a compound system made up of particle 1 bound to the scatterer. For 
the total energy F® < ati + aU eigenstates are singular and of this type. A aiTig n1fl.r< 
eigenstate of this special type can be written as a ket vector | F^^^Fgy^®). There are 
^o ways of representing this state in momentum space. One is to take the usual 
Founer transform from the co-ordinate representation, but this has the dis- 
advantage that {kjka I Fi”>Fg;5o> does not contain a S factor in the kinetic energy, 
^e other is to use a complex momentum space denoted by <*kik2 1 in which FJ may 
be less than k^. Using this the eigenfunction takes the form 

<*kik2 1 = ^(Fi, Fi«))<J(F2- F§)x(kik2). 


(7-6) 



557. 


Heisenberg's S matrix 

The use of complex momentum space makes the treatment of singular eigenstates 
for TF'i</Ci formally the same as for TF5>/Ci. The eigenfunction equation becomes 

JJ<*kik2 1 S I {^kf^k# | 

= A( gyffo) <*kika 1 (7-7) 

Since we are considering a real scattering problem here (and not the analytic con- 
tinuation of one) the eigenvalue A is of modulus unity for > /Cg. if 

is continued analytically to the region 0 < T ?2 < /Cg, the energies of the bound 
states of particle 2 are obtained from the solutions W% = of 

- 00, kl^i\kll (7-8) 

A(Tri^>PF§/?^>) - 0, kl = -^i\kl\. (7-9) 

The value is the energy of particle 2 bound, subject to particle 1 having energy 

Wi^\ Hence = Tf is the energy of a state with both particles bound to 

the scatterer. 


' 8. FuBTHBE IJrVESTIGATIOlS' OF EIGEISTSTATBS 

We have the following commutation properties in general, 

F] = 0, [ 8 , FJ = - [-S, FJ + 0. (8-1) 

All the eigenstates considered are simultaneous eigenstates of 8 and F . The singular 
eigenstates aje simultaneous eigenstates of 8, and Tf^. Not every eigenstate is 
a sTugiilar eigenstate unless there is zero interaction between the particles, except 
in certain bounded energy ranges. 

The eigenvalue equation for a singular eigenfunction is 

JJ<FiWiF2(U, I 8 1 F2oiFgo^>dwid6>^!r(FKl^E^) 

= A{F2F§y3«)^(Fi- Wl)d{W^-Wl)x{W^o)y^W^6)2). (8-2) 

Apply I* dl^^ f dW^ to both sides of (8-2) and let €->-0. The right-hand 

J Wl-e J Wl-e 

side gives X{WlWlP)x{Wl<i)J7l(Oz), and the left-hand side gives zero unless it 
contains two 8 function factors of the type ^ (lli — FJ) 5(11^ — • Th.ese always occur 
in <Wi(OjJ¥z <^2 1 ^ 1 FJwiFgwa), and arise both in the term corresponding to a product 
of incident waves, and in the term corresponding to interference between the incident 
wave of one particle and the outgoing wave of the other. 

If there exist eigenstates which are a mixture of the singular and the non-singular 
types, they can be written 

^(F-F'>)«(kik2)-i-5(ll^-F?)5(F2-F§)a:(kik2). (8-3) 

The eigenvalue A(F®) corresponding to (8-3) is a particular function of F® having 
ging n1fl.r points at F® = F^“) < /Tj + Kz- We cannot at once int^pret F^"^ as the bound 



658 


R. J. Eden 


energy of two particles, since this result was obtained when A( IF®) was an eigenvalue 
of a non-singular eigenfunction. The eigenvalue equation is 

JJ<kik2 1 S 1 kik^> dkidk^{^(F'- F®) «(kik^) + d(Fi- FJ) 5(F' - F§) £r(klk')} 

= A(F®){<J(F- F®)«{kik2)-)-(J(Fi- Wl)8{W^-Wl)x{^^^)li^)}. (8-4) 

If TTJ > /Cl the range of integration will be along the real axis from to — /Cg. 
But if TTi < ^1 the TFi) factor has to be replaced by S(W[, Wi) which requires 

rwl+e rwl+e 

the path to be a small circle round TF®. Apply dW^ dW^ to equation 

J TFJ-e J Wl-e 

(8-4). If TfJ < /Cl we would apply an integral with respect to round the point TFJ 
and radius e. Let e->0. The right-hand side gives A( IF®) a;(kik 2 ) with = TfJ, 
— TF|. The first term on the left gives zero since it is not singular in both 
and Hence 

rwi+€ /• rr 

J I ^ I kik'>dkidl4(J(Fi- F?) <J(F^- F§).a:(kik') 

= A(F®)a:(kik2), (8-5) 

with Wi = F°, W 2 = Fa, in a;(kik 2 ). This means we can take 

^(Fi-F?)^(Fa-F§)a:(kika) 

itself to be an eigenfunction of 8, belong to the eigenvalue A{F®). Hence 

d{W- F®)tt(kik2) 

is also an eigenfunction belonging to A(F®). We can deduce that the singularities 
F^”) are the energies of bound states of the two particles. This has the consequence 
that <J(Fi - F?) diW^ - F§) a;(kik 2 ) is a special singular eigenstate with either 
Wx == < /Cl, or TF 2 = < /Cg. Thus any eigenstate of a mixed type can always 

be represented as a superposition of a singular and a non-singular eigenstate. The 
singular state will in this case correspond to one of the particles bound to the 
scattering centre. 

When the total energy of the system TF® < /Ci -f /Cg, one particle must be bound to 
the scattering centre. This means that in any eigenstate the energy is conserved 
asymptotically, so the only possible eigenstates are of the type (7-7). These eigenstates 
vary in number according to the number of levels allowed by energy conserva- 
tion, which requires TF$^^ + ^2 < J'oi* each TF® these singular eigenstates and their 

corresponding eigenvalues define an S matrix, which I caU the ^partial S matrix^ 
since it contains only a part of all possible scattering matrix elements. The concept 
of a partial S matrix is only relative, since for a general system in which particles 
can be created and annihilated, the size of the 8 matrix increases indefinitely with 
increasing energy. The partial 8 matrix is unitary in order to conserve probability 
in a collision, but its ^piece-wise’ anal 3 ?i}ic behaviour gives rise to difficulties in the 
theory, which wiU be discussed in more detail in another paper. 



559 


Heisenherg^s S matrix 


9. Conclusion 


The extension of the foregoing work to a system composed of an arbitrary constant 
number of particles and a fixed scattering centre should be possible without intro- 
ducing any more new concepts. However, there are one or two points worth men- 
tioning. 

The non-singular eigenstates will refer to pure scattering of all the particles with 
mutual interchanges of energy which prevent any particle, or set of particles less 
than the complete system, from remaining on the kinetic energy shell. The wave 
function in co-ordinate space will be 

f„(rir 2 ...r„) = u(rir 2 ...rj+v(rir 2 ...rj, (9-1) 

corresponding to (5-7). For asymptotically large r-^. 


M(rir2...r„)- 


-1 


(27r)' 


—l+\n 




dWctd(i)^,..dW^d(ji)^ 


exp ( - ■i^?-i)exp (tkara) 


exp(tk^r „ 

' V(AJ 


) 


u{Wfa-J7^a^...WnO)n). 


(9-2) 


In the integrand of (9*2), Wf = W®— (9’3) 

i=2 

The path of integration of 1^ for i = 2,3, is to PF"®— S -Analytic con- 

j'¥i 

tinuation of TF® by path [a] (figure 2) to a point just below the real axis, will cause 
the paths of integration of all {i = 2, 3, to move down below the real axis. 

n 

Hence Wf, given by (9*3) with equal to its final real value less than S will 

1 

move along a path having kf = i\ lc^\ at all points. Thus t^(rir 2 ... r^) diverges in 
and similarly ^(rirg ... r^) converges in The remaining work follows as before. 
There will be many singular eigenfunctions, corresponding to kinetic energy con- 
servation of any set of m particles where m<n. These eigenfunctions arise from the 
very complicated mixing between incident waves of some particles and outgoing 
waves of others, and also from the states in which some of the particles are bound 
to the scattering centre. The concept of a partial S matrix will still be valid, but it 
will be discontinuous at a large number of energy values. 


I would like to express my thanks to Professor P. A. M. Dirac for valuable advice 
and criticism, and to Professor W. Heisenberg for many helpful discussions. 


Refebenoes 

Dirac, P. A. M. 1948 Principles of quantum rmchanics, 3rd ed. Oxford University Press. 
Heisenberg, W. 1943 a Paper I. Z. Phys. 120, 513. 

Heisenberg, W. 1943& Paper II. Z, Phys. 120, 

Heisenberg, W. 1944 Paper HI. Z. Phys. 123, 93. 

Heisenberg, W. Paper IV. Unpublished, (See Heisenberg 1946.} 

Heisenberg, W. 1946 Z. Ncdurforsch. 1, 608. 

Heisenberg, W. 1949 Two lectures. Cambridge University Press. 

Hu, NT. 1948 Proc. E. Irish Acad. 52, A 6. 

Meller, C. 1945 K. danske vidensk. Selsk. Math-Fys. Medd. 23, no. 1. 

Moller, C. 1946 K. danske vidensk. Selsk. Math-Fys. Medd. 22, no. 19. 

Wentzel, G. 1947 Eev. Mod. Phys. 19, 1. 

Vol. 198 . A. 


37 



Penetration of rnagnetic field into superconductors. 
II. Measurements by tbe Casimir method 

. By E. LATTHMAifrer and D. Shoenberg 

Royal Society Mond Laboratory ^ Cambridge 

{Communicated by Sir Lawrence Bragg, F,B.S. — Received 14 March 1949 — 

Read 16 June 1949) 


The changes with temperature of penetration of a magnetic field into superconducting tin 
and mercury were studied by a method due to Casimir in which a mutual inductance with 
a superconducting core is measured using low-frequency currents. The results were found 
to be very sensitive to surface conditions, but single crystals with smooth surfaces gave 
reproducible measurements of A(T)— A (2-17° K) as a function of temperature T, These 
were consistent with the formula A(T) = Ao(l--(T/Tg)^)~i, where is the transition tem- 
perature, and Aq was found to be 5-2 x 10“® cm. for tin and 4*3 x 10~® cm. for mercury. For 
tin there was no significant difference between the values of Aq for current flow in different 
crystal directions, though a difference of up to 20 % is not excluded. For mercury there is 
a suggestion that Aq is about 20 % higher for current flow perpendicular to the principal 
axis than it is for current flow parallel to the principal axis, but this difference is little more 
than might be due to experimental errors. There was no evidence for any dependence of A 
on a steady magnetic field JT, though an increase of 10 % up to 80 % of the critical field is 
not excluded. 


IlTTBODUCTIOK 

The first attempt to study the depth of penetration of a magnetic field into a super- 
conductor of macroscopic dimensions was made by Casimir (1940). The mutual 
inductance of two coils wound on a superconducting mercury core should become 
slightly smaller as the temperature is lowered, if the penetration depth decreases with 
temperature as suggested by earlier experiments on colloidal mercury (Shoenberg 
1940). The expected decrease of mutual inductance was very small, of order 0’3/tH 
in the apparatus of Casimir, but with the sensitive bridge used should have been 
easily detected. In fact Casimir found no decrease that could be attributed to this 
effect and concluded that either there was a difference in the behaviour of macro- 
scopic and colloidal specimens, or the superconductor did not behave in the same 
way at the frequency of measurement (about 80c./sec.) as in the static conditions 
of the colloid experiments. 

Since this question is of considerable importance for the phenomenological theory 
of superconductivity, we decided in 1946 to repeat Casimir’ s experiment, but before 
these plans had got very far Professor Casimir informed us that he had discovered 
a possible reason for the negative result of his original experiment. The mercury 
cylinder he had used was sealed off under vacuum in a quartz tube on which the coils 
were wound, and this tube would have slightly increased its diameter as the external 
pressure was reduced in order to lower the temperature of the helium bath. An 
estimate showed that this would have caused a slight increase of mutual inductance 
which roughly compensated the expected decrease due to the reduction of penetra- 
tion depth. At about the same time further confirmation of the temperature variation 

[ 560 ] 



P enetration of magnetic field into supercond/actors. II 561 

of penetration depth was obtained by Desirant & Shoenberg (1948), who measured 
the static magnetic properties of thin superconducting cylinders, and by Pippard 
(1947)5 measured the difference in skin depth in the superconducting and 
normal states. In view of this new evidence, the emphasis of our plans changed some- 
what; instead of investigating the cause of a negative result — ^as, for instance, by 
varying* the frequency of measurement — ^we concentrated on using the method to 
obtain accurate measurements of the change of penetration depth with temperature 
in various conditions. 

Positive results were indeed obtained with both tin and mercury, but it soon 
became apparent that the state of the surface of the specimen had to be carefully 
controlled before these results could be considered to have any intrinsic value. 
Early in the investigation it seemed very possible that the penetration depth was 
strongly dependent on the direction of current flow with respect to the crystal lattice 
(Laurmann & Shoenberg 1947), and most of the subsequent work was concerned with 
studying this possibihty; it has turned out in fact that the anisotropy, if it exists at 
all, is much smaller than seemed at first indicated. Another question that has been 
investigated is the possible dependence of penetration depth on the magnitude of 
a steady magnetic field superimposed on the alternating measuring field; the in- 
dications of such an effect previously foimd (Desirant & Shoenberg 1948) seem to 
have been due to secondary causes, and the present experiments show that, up to 
80 % of the critical field, the penetration depth does not increase by more than 
a few per cent. 


Details of EXPEEmnisrTAL methods 
The measuriing apparatus 

Since the apparatus was in most respects very similar to that already described 
by Casimir, only brief reference to quantitative details need be given, apart from 
a few points of difference in principle. 

The detailed arrangement of the coil system is shown in figure 1 ; the purpose of 
the upper pair of coils, which are connected to give a mutual inductance opposite 
to the main pair, is to compensate most (usually more than 90 %) of the residual 
mutual inductance left over when a superconductuig specimen is inserted as shown. 
This residual mutual inductance arises from the space occupied by the quartz tube 
and the gap between the specimen and the quartz, and is fairly large, about 1500 /iK 
for a 9 mm. diameter specimen. If the whole residual mutual inductance were com- 
pensated by a variable mutual inductance at room temperature, small changes of 
room temperature in the course of an experiment could (on account of thermal 
expansion) cause significant errors of order 0*01 /^H, and special precautions would 
be necessary to keep the room temperature constant; by putting most of the com- 
pensation in liquid helium where thermal expansion effects are negligible this trouble 
is avoided. The quartz tube containing the coils was placed in the appendix of a 
Dewar vessel, which could be filled with liquid hehum. It was found that slight 
changes of the inclination of the specimen to the quartz tube could cause appreciable 
changes of mutual inductance, and so to prevent rattling, the specimen was usually 



562 


E. Laurmann and E. Shoenberg 

lightly wedged with paper, the quartz tube was secured with cotton-wool plugs, 
and care was taken not to touch the apparatus during a series of measurements. . 

A Hartshorn bridge was used to measure the mutual 
inductance, as in Casimir’s experiment. The bridge 
(figure 2) was fed by a phase-shift oscillator supplying 
alternating current of frequency 72-5 c./sec. and of good 
wave form. The variable mutual inductance was a Tinsley 
standard, which could be varied in steps of 10 juIL, in series 
with a continuously variable mutual inductance, designed 
by Dr Ashmead, which could be set and read rehably to 
0-01 /tH. The output of the bridge was taken through a 
three-stage amplifier to a Campbell vibration galvano- 
meter. 

Measurements of the in-phase component M' of the 
mutual inductance proved to be very important, and could 
be deduced from the resistance R of figure 2 by the relation 

= ( 1 ) 

where o) is the angular frequency. For small values of 
M\ the balance was not appreciably disturbed until B 
(Mered considerably from the true setting, and so a 
‘bracketing' method was used to find M' more accurately. 

A value of B was found for which the galvanometer de- 
flexion was equal to that for the switch S open {B 
infinite); the true value of B was then just twice as great 
(since for large B, r and r' may be neglected in the i* The coil system 

o (half actual size). 

denominator) . and oompensatingprimaries 

A good deal of attention had to be paid in the lay-out of 62-8 turns per cm.; main 
the apparatus to screening connexions from the low- ®®^ondary 8000 turns; 
temperature part of the apparatus to the bridge and to 
earthing the right points of the apparatus, to avoid sp, specimen. 



Figtjbe 2. The measuring circuit. O, oscillator; A, amplifier; G, vibration galvanometer; 
8, two-way switch to change from positive to negative in-phase adjustment; R> six-dial 
resistance box; rr, fixed low resistances, each IQ; r', fixed low resistance, 1*04:Q; M, the 
coil system shown in figure 1. 




Penetration of magnetic field into superconductors. II 563 

various stray capacity effects. The apparatus was very sensitive to local disturbances; 
some were caused by magnetic pick-up from neighbo]iring power cables, the band 
width of the detecting arrangements not being small enough to cut out 50 c./sec. 
effects completely, and some (of an irregular kind) due to some complicated effect of 
high-frequency radiation, as, for instance, from the neighbouring cyclotron. Most 
of these troubles were avoided by working at night when the disturbances were 
usually found to be very much smaller. 

The overall sensitivity was such that for 20 mA in the primary circuit (which 
gave a peak field of about 2 gauss at the surface of the superconductor) a change of 
IpbH. mutual inductance gave a deflexion of 8 cm. of the vibration galvanometer; 
since in good conditions the 'zero’ deflexion (i.e. for no current in the bridge) was 
less than 1 mm., it was possible to detect changes of mutual inductance of less than 
0-01/^H, i.e. somewhat beyond the accuracy with which the continuously variable 
mutual inductance could be read. In practice usually the mean of five readings was 
taken for each measurement and, as will be seen later (p. 575), the consistency of 
the experimental results indicated that such means had standard deviations of 
order 0-004/6H. 

Method of reduction of observations 

The mutual inductance of the coil system when a superconducting rod of cross- 
section A and perimeter p is inserted can be written as 

M (2) 

where is the mutual inductance of the empty bottom pair of coils, that of 
the top compensating pair (which, as was verified experimentally, was sufficiently 
remote to be unaffected by the presence of the superconductor), A is a constant and 
A is the penetration depth of a magnetic field into the superconductor. Since the 
radius of the superconductor is much larger than the penetration depth, the latter 
can be defined, independently of any assumptions about the law of penetration, as 

(3) 

where Hq is the applied field, and H is the field inside the superconductor. If now the 
temperature is varied, the only changes in M, apart from possible small corrections 
which will be considered later (see p. 664), arise firom changes in A, and we have 

Mi = M{T)~ M{To) = Ap[A(2’) - A(yo)3 = kpM, (4) 

where % is some standard temperature, usually 2-17°K in our experiments.* Thus 
if h and p are known, AA can be found from experimentally observed values of Aitf. 
If the primary coil and the superconducting rod can be treated as infinitely long, 

JiJ) = kgA-Q, (6) 

k = kg = iirnN, (6) 

where n is the number of turns per unit length, and A(, the cross-section of the primary 
coil anH N the total number of turns of the secondary. 

* Since A varies very little with temperature far below the transition temperature, the precise 
value of Tq is not important. 



564 E. Laurmann and D. Shoenberg 

In fact^ since the primary coil is of fibtiite length, k may be appreciably less than 
k^, and its value was found experimentally both by measuring the decrease of M 
when a superconductor of known cross-section area A was inserted, and from the 
slope of the straight line obtained when M is plotted against A for the 20 % range of 
A covered by the different specimens used. These values of k agreed to better than 
1 %, and were about 8 % lower than kQ. Another estimate could be made from the 
measured absolute value of Mq, using (5), and although it is not obvious that this 
value of k should be quite identical with that used in (2) it did in fact agree to 1 % 
with the previous estimates, and also agreed reasonably with a calculation of Mq 
based on the detailed dimensions of the coils. 

In applying (4) to calculate AA, we shall assume that p = 27rr, where r is the radius 
of the specimen. It will be convenient to postpone a discussion of the validity of this 
assumption, equivalent to assuming a perfectly smooth surface, until the experi- 
mental results are considered. The value of r for the tin specimens was measured 
directly by a micrometer, but for the mercury specimens this could not be done so 
easily, and A (and hence r) was deduced from the value of M at Jq, using equation (2), 
and the experimentally determined value of k. 


Discussion of possible systematic errors 

Since the measured effect is so small it is necessary to consider various possible 
causes of systematic error before we can have confidence that observed changes of 
M are not due to causes other than the changes of penetration depth we wish to 
measure. 

One such cause might be a change of specimen radius due to thermal contraction. 
The coejBicients of thermal expansion of tin and mercury at liquid helium tem- 
peratures have not been measured, but, on the basis of Grtineisen’s law, can be 
estimated. It turns out that the lowering of temperature from 4 to 2° K could produce 
a change of radius of at most lO-*^ cm. (in mercury, for which the coefficient of thermal 
expansion is greatest). Since this is only just comparable with the accuracy to which 
AA can be measured, no correction has been made for this effect. 

A slightly larger effect comes from change of radius caused by the change in 
hydrostatic pressure of the helium bath as the temperature is lowered; this can be 
calculated fairly accurately (using data of Griineisen & Sckell (1934) for solid mer- 
cury) and the appropriate small corrections (of order 2 x lO*"'^ cm.) have been applied 
in the results. 

A number of other possible sources of error were ehminated by a blank experi- 
ment in which the specimen was removed and it was established that M changed 
by less than 0*01 when T was reduced from 4*2 to 2*1°K. This showed that no 
systematic error greater than 0*6 x 10"“® cm. could arise from any changes in the coil 
system, such as thermal contraction, mechanical changes, or changes of magnetic 
properties, associated with the reduction of temperature. One cause of systematic 
error, due to the spreading of the superconducting transition somewhat below the 
transition temperature, will become apparent when the actual experiments are 
described, and is discussed later (p. 671). 



Penetration of magnetic field into superconductors. II 


565 


Temperature measurement 

The temperatures were deduced from the vapour pressure over the helium bath 
using the 1932 Leiden scale (Keesom 1932); although this scale may be in error by 
as much as 0-01° K in some regions, this is unimportant, since for our purposes it is 
only a high relative accuracy that is required. No stirrer was used in the bath, but 
it was found that temperature equilibrium was quickly achieved during lowering 
of temperature by reducing the pressure. Rapid warming was produced by using 
the main secondary coil as an electric heater, but thermal equilibrium was only 
slowly achieved, so all our readings were taken during cooling. 

Preparation of specimens 

The specimens used at liquid helium temperatures are listed in table 1* together 
with such relevant features as wiU be needed later* 


Table 1 

p at 18° C 
for tin ; at 





prepara- 

crystal 

80° K for 




diameter tion 

orientation 

mercury 

p at 4-2° K 

specimen 

material 

(mm.) 

method 

it) 

(Qcm. X 10 ®) (Qem. x 10 ®) 

TO 

12966 

8-67 

a 

— 

— 

2-5 

T 1 

2135 

9-16 

b 

15° 

10-3 

3*9 

T3 

2135 

9*16 

b 

79° 

12-7 

4-1 

T5 

2135 

9-26 

b 

90° 

12-7 

4.4 

T 6 

2356 

8'60 

b 

1 ° 

10-4 

1-9 

T9 

2356 

9*45 

b 

8 ° 

10*4 

2-1 

TIO 

2135 

8-98 

f 

( several crystals 
t all -90° 

13-8 

4*0 

M2 

H 11023 

9*86 

c 

— 

— 

34^3 

M3 

H 11023 

9*86 

c 

— 

— 

31-9 

M4 

H 11023 

9*86 

c 

— 

6-57 

42-5 

M7 

2505 

7*76 

d 

— 

6-63 

38-5 

M 8 

2505 

7-95 

d 

— 

6*62 

44-0 

M9 

2505 

7-58 

d 

— 

6-33 

40-4 

M13 

2505 

9-00 

e 

33° (e) 

6-79 

40*2 

M14 

2505 

9-00 

e 

62° (e) 

[ at least 3 

6*38 

37-0 

M15 

2505 

9-00 

e 

■{ crystals 
[ 89°, 117°, 65° 

6-03 

35-2 

M16 

2505 

9-00 

e 

27°, 31° ( 6 ) 

6*82 

40-0 

M17 

2505 

9*00 

e 

56°, 65° (e) 

6-34 

37-0 

Notes. T 

= tin, M = 

mercury. 

Material was all spectroscopically pure, Johnson Matthey 


except where marked H (Hilger H.S. quality). Where no crystal orientation is given it is probable 
that specimen was polycrystalline. Orientations deduced electrically are marked (e). 


The various methods of preparation were as follows: 

(a) Cast in a glass tube under vacuum; cracked away from the glass and finally 
carefully machined. 

* Only those ^ecimens to which reference is made in the discussion were tested. Actually 
twenty-five specimens in aU were used in forty liquid helium experiments. 



566 


E. Laurmann and D. Shoenberg 

( 6 ) Deliberately grown as a single crystal in a glass tube under vacuum by slowly 
lowering a surrounding furnace; a seed crystal of approximately the desired orienta- 
tion but much smaller diameter was used in each case. When cold, the glass wall 
was broken away as carefully as possible and the seed and tailpiece cut off with 
cutting pliers; it was noticed that the glass often cracked during the slow-cooling 
process. This was probably due to the sticking of the metal to the walls, causing 
implosion as the tin contracted on cooling. 

(c) Mercury slowly cooled from the bottom in the quartz tube on which the 
primary coils were wound. Except on one occasion, the quartz tube was found broken 
at the end of the experiment each time this method was used, probably for the same 
reason as mentioned above for tin. It is possible that the breakage may have 
been due to expansion of the mercury on re-melting, but, since care was taken to 
avoid this by warming from the open end, the ‘^implosion’ explanation is more 
probable. These breakages were of course very awkward, since they necessitated 
making a new mutual inductance on each occasion, and eventually the following 
two methods were tried. 

(d) Mercury slowly cooled from the bottom in a separate quartz tube of smaller 
diameter which could be lowered into the quartz tube on which the primary coils 
were wound. As will be seen later, the results obtained by this method, even though 
considerable variations of the cooling schedule were tried, were mostly unsatis- 
factory, In nearly every case the quartz tube was found broken at the end of the 
experiment; in one or two cases when it was possible to examine the tube immediately 
after removal from the helium bath, but before the mercury had warmed up to 
liquid nitrogen temperatures, the tube was found already cracked, showing clearly 
that the breakage occurred during cooling rather than warming. It was this evidence 
that eventually led to the ‘implosion’ explanation, and, as will be seen later (p. 57 6), 
the same phenomenon of the mercury sticking to the quartz is probably responsible 
for the unsatisfactory results obtained with these specimens. 

(e) Free-mercury single-crystal rods were prepared by the method of Andrade & 
Hutchings ( 1935 ) m which liquid mercury in a precision-bore glass tube whose 
internal wall has been wetted with alcohol, and closed by a steel plug at the bottom, 
is slowly lowered into alcohol cooled by solid COg.* After solidification, the steel 
plug (coned to a point internally to promote growth of a single crystal) was with- 
drawn and usually the mercury rod was found to slide out of the tube quite easily. 
Before inserting the rod into the experimental quartz tube (which was pre-cooled 
to 80° K), excess alcohol was wiped off with a rag to prevent sticking, and the rod 
cooled in liquid air (to avoid the danger of melting during the transfer process); the 
rod was handled by a long quartz rod enlarged at its bottom end which was embedded 
in the mercury during solidification. It is of interest t 6 note that three of these rods 

* Owing to the larger length and diameter of our rods, some slight modifications were found 
necessary. The lubricating alcohol had to be made rather more viscous by the addition of 
glycerine and the cooling had to be completed rather rapidly; it seemed that if the alcohol was 
too ‘thin' or the cooling too slow, the lubricating alcohol was able to drain away before the 
mercury solidified, and thus did not prevent sticking to the glass. The rapid cooling may have 
been responsible for the fact that one of oru rods proved to .consist of several rather than a single 
crystal. 



Penetration of magnetic field into superconductors, II 567 

(M 15, 16 and 17) were preserved for 3 montlis in liquid air while arrangements for 
X-ray examination were being prepared. 

(/) Attempts were made to grow tin crystals using a technique similar to that 
outlined in (e). The steel plug was replaced by a similar carbon plug, and the lubricant 
used was a silicone oil instead of alcohol. It was hoped that by avoiding sticking to 
the walls a slightly better quality specimen might be obtained. In fact the results 
were no better than those obtained by (6), and, moreover, the specimen actually 
used proved later to be not a single- crystal. 

Examination of the specimens 

{a) Visual 

Parts of the surface of the tin crystals were covered with small hemispherical pits 
caused either by bubbles or by shrinkage of the tin during solidification. A rough 
estimate showed that although these pits were numerous, they covered only a very 
small fraction of the surface area, and so their effect in increasing the perimeter was 
probably negligible. The fact that both the tin and mercury crystals appeared highly 
polished suggested that there were no irregularities on a microscopic scale (i.e. com- 
parable with a wave-length of light) which might increase the perimeter. The etched 
specimens could be seen to be very rough, and it was not surprising that the effective 
perimeter of these proved to be much greater than 27rr. 

(6) Electrical 

Measurements of specific resistance at low temperatures were useful as a clue to 
the^ quality of the specimens, and at high temperatures as indications of crystal 
orientation (see below). The values of p given in table- 1 were deduced from the 
changes of mutual inductance caused by eddy currents in the specimen, using standard 
formulae for the complex permeability of a cylinder in an alternating magnetic field. 
For tin p at 4*2° K is mostly 'residual’ resistance, and the low values found confirm 
the high purity of the material used. It is interesting that the values of p at 4'2°K 
seem to be correlated strongly with the batch of tin used, but hardly at all with 
crystal orientation, suggesting that perhaps residual resistance is not as strongly 
anisotropic as ideal resistance. 

For mercury, p at 4-2° K is almost -entirely 'ideal’ resistance, and there was an 
interesting contrast in the behaviour of the ' good ’ specimens (prepared by (e) above) 
and the others. For 'good’ specimens the values of p were in the same ratios at 
4-2° K as at 80° K, showing that the same arrangement of crystals persisted near the 
surface (within the skin depth of order 0-6 mm.) as in the body of the specimens. 
For most of the ' poor ’ specimens, however, there seemed to be no correlation between 
the two sets of resistance values, suggesting either that strains at the surface have 
upset the crystal orientations there, or more probably that these strains have caused 
the 'residual’ resistance to become so large as to be comparable with the 'ideal’ 
resistance. 

(c) Crystal orientation 

For the tin crystals, the angle ijr between the tetragonal axis and the rod axis was 
found to within a few degrees by the etch-pit reflexion method, using either the etched 
specimen itself or the etched tailpiece. The specific electrical resistance p at room 



668 E. Laurmann and D. Shoenberg 

temperature provided useful confirmation of these determinations, the ratio of the 
values for the "parallel’ and "perpendicular’ crystals being in good agreement mth 
that found by Bridgeman (1925).* Some electron diffraction studies by Mr J. W. 
Menter on tin specimens similar to those used in the low-temperature experiments 
confirmed that they were single crystals within the small depth of penetration of 
the electrons used (^^5 x 10~'^cm.). This eliminates the possibihty that the surface 
layer in which the superconducting current flows has a structure differing from the 
bulh of the specimen. 

For mercury, the orientations were deduced from the value of p at 80° K. It can 
be shown (Fraser & Shoenberg 1949) that if the principal axis of a single-crystal 
cylinder is inclined at angle ^ to the cylinder axis, the cylinder behaves in an 
alternating field as if it had an isotropic resistivity p given by 

p=‘\[{pt+pz) + {pi-pz)^^^^fl (7) 

where p-^ and p^ are the resistivities perpendicular and parallel to the principal axis. 
Thus, using Sckell’s (1930) values, = 7-04 x 10”®ncm., p^ = 5-35 x 10~®ficm. at 
80° K, the orientation ^ of a cylinder can be estimated from its value of p. For 
M15, 16 and 17 back-reflexion Lane X-ray studies at hquid air temperature were 
made by Mr J. V. Smith, and the orientations found for M 16 and 17 agreed within 
experimental accuracy with those deduced electrically; M 16 (in agreement with 
visual indications of gram boundaries) proved to consist of at least three crystals 
vrith their principal axes roughly perpendicular to the rod. It should be noticed that 
(7) applies only to a single crystal; if the specimen contains more than one crystal, 
p may have any value between and pg, and m such cases p gives a measure of the 
degree of preferred orientation. For instance, p = p^ would indicate that all the 
crystals had their trigonal axes perpendicular to the cylinder axis, and also 
perpendicular to a radius normal to the cylinder axis. 


Expeebiental eesxjlts 
General features 

Measurements of M and M' (the real and imaginary parts of the mutual inductance) 
were made as the temperature was lowered, and typical curves are shown in figures 
3, 4 and 5. In the normal state M and Jf' are nearly constant, and from their values 
the specific resistance at 4-2° K was deduced. During the transition to supercon- 
ductivity, both M and M' fall rapidly over a narrow temperature interval — of order 
0*01° K for a good tin specimen, 0-003° K for a "good ’ mercury specimen, and 0-02° K 
for a "poor’ mercury specimen. The transition temperature may be taken with an 
uncertainty usually less than 0-001° K as the temperature at which M has completed 
half its descent. 

For further lowering of temperature, the variation of M and M' is very much less 
rapid and is shown in figures 6, 7 and 8, where the scales of ordinates are greatly 

* Our absolute values were, however, about 5 % higher than those of Bridgeman; part of 
this discrepancy may be due to slight systematic errors in our method of finding p. 



Penetration of magnetic field into superconductors. II 569 

expanded (400 times in figure 6 and 1000 times in figures 7 and 8). In these diagrams 
it is convenient to plot not M and M\ but their differences LM and Aifef' from the 
values they assume at a low temperature (2*17°K). In fact if' should become zero 



Figube 3. Transition to superconductivity of T 9. 



if no energy is dissipated in the specimen, but owing to small dissipation elsewhere 
in the circuit, M' approaches a constant value (usually of order 0* 2/^11) at low 
temperatures, and it is reasonable to assume that only A Jf ^ represents the dissipation 


for both right-hand ordinates 



pbK (for both left-hand ordinates) 


570 


E. Laurmaim and D. Shoenberg 



^ 3-60 3-65 3-70 

temp. (°K) 


Figure 6. Variation of AM and AM' for T 9 below illustrating method of correction for 
in-phase effect. AikT^, AJkf', with earth’s field compensated; AikTg, AMg, with earth’s 
horizontal field not compensated; Aikfcorr.» corrected for in-phase effect. 



temp. (°K) ’ temp. (°K) 

Figure 7. Variation of AM and AM' for a ‘good’ Figure 8. Variation of AM and AM' for a ‘poor 
mercury specimen, M 15, below (earth’s field com- mercury specimen, M 9, below (earth’s fielc 

pensated). No in-phase correction is necessary. compensated). AMcorr.» corrected for the in 

phase effect.* 

* The curves for the earth’s field uncompensated cannot be* shown conveniently on the same 
scale: thus at the highest temperature, 4*134° K, AMg = 5*86 /*H and AM' = 1*92 while 
at 4*071°K, AMg = 1*53/^H and AM; = 0*67 



Penetration of magnetic field into superconductors. II 671 

effects due to the specimen. As for LM, this should be proportional to AA, as already 
explained, and, as can be seen from figures 6, 7 and 8 the changes expressed in cm. by 
means of (4) are of the expected order of magnitude. 

It will be seen that for M15, AAf' has fallen to practically zero within about 
0*01° K of Tq, but for T 9 and more so for M 9, LM' is stifl. appreciable at much lower 
temperatures. The probable interpretation of a non-zero Alf' is that a very small and 
diminishing fraction (of order 1 in 5000 for T 9) of the metal surface remains in the 
normal state below and that AAf' arises from eddy-current losses in these normal 
regions. In other words, although most of the transition to superconductivity takes 
place in a narrow range of temperature, the transition actually extends a long way 
below this range. Now in a normal region the alternating magnetic field wiU penetrate 
very much deeper than in a superconducting region, in fact, to the order of the skin 
depth, which for the frequency used is about 0*2 mm. in tin and 0*6 mm. in mercury. 
Thus the presence of a non-zero Aikf ' is an indication that the Ailf may be due not 
only to change of genuine superconducting penetration depth but partly also to 
ordinary skin-effect penetration in small normal regions. That this is really so is 
shown by the fact that AAf at a given temperature was larger if Aif' was larger. 

Particularly large Ailf' values can be obtained, as already noted by Casimir, if 
a specimen is cooled in the earth’s magnetic field. The curves marked AJfg ^-^2 
in figure 6 refer to such a case, and it is evident that the part of AMg associated with 
the in-phase component AJfg is quite comparable to the part with which we are 
concerned, arising from change of penetration depth. The large values of AJ^ and 
AJkfg are presumably due to ‘freezing in’ of the earth’s horizontal field in a few 
patches where there are holes or inhomogeneities; as the metal cools, the critical 
field rises and the normal patches where the trapped flux issues shrink to keep this 
flux constant. 

To avoid this effect, the earth’s field was carefully compensated (to within about 
1 %) in ah the basic experiments (as, for instance, in the tsM^ and curves of 
figure 6), but a subsidiary set of readings was also taken during a second cooling in 
which the earth’s horizontal field was uncompensated (e.g. the Aifg and AAf 2 curves 
of figure 6). On the basis of these readings it was possible to estimate a correction to* 
allow for that part of AJlfi associated with the in-phase component Aif^ and due to 
penetration into normal regions. Thus if it is assumed that this unwanted extra 
part is proportional to AAf^ (and it was foxmd that this was roughly true), the 
extra part is given by 

Aifi(Ai4 - AJfi)/(AJf' - Aitfi)* 


For T 9, as shown in figure 6, this produces a. correction of order 10 %, and the curve 
Aikfcorr. is obtained when this unwanted part is subtracted from A This procedure 
was applied in all the results quoted below, but, since the accuracy of the correction 
is somewhat uncertain, it is not impossible that systematic errors of as much as hah 
the correction may still be left, and where (as for M 9) a correction of much more 
than 20 % is required, the results must be considered unreliable. 

The ‘pathology’ of this ‘in-phase’ effect proved to be complicated, but it seems 
that the residual Alf' effect when the earth’s field has been compensated is least in 



672 


E. Laurmann and D. Shoenberg 

specimens witli the least internal stresses. Thus there was a great difference between 
the mercury specimens prepared by the methods (d) (M 5 to 12) and (e) (M 13 to 17) 
above; the cracking of the quartz containers suggested that the former were under 
considerable, stress and the Aikf' values for most of these were high, quite comparable 
occasionally with the high values obtained when the earth’s field was uncompensated. 
The free rods, however, of which figure 7 represents a typical example, had almost 
negligible values of A ikT ' except very close to As already mentioned the transition 
curves for the bad specimens were much broader than those for good specimens, 
again suggesting a worse state of internal stress. A further indication that AM' is 
associated with some sort of defect is that generally (though not always) the AM' 
values with the earth’s field compensated are highest for specimens where lack of 
compensation during cooling has the biggest effect in increasing AM'; thus for some 
of the free mercury rods where AM' was almost negligible the earth’s field had hardly 
any effect either, except very close to 7^. 

For any one specimen, the AM' values were not always reproducible from one 
experiment to another, and sometimes even within the same liquid helium run the 
AM' values became twice as large (and the AM values increased correspondingly) 
in a second cooling, although the earth’s field was compensated throughout.* 
Occasionally it was found possible to reduce AM' considerably by temporarily 
' increasing the measuring current enough to start destruction of superconductivity. 
These various effects suggest that the normal regions responsible for AM' are not 
very stable, and that their formation and arrangement may depend on rather trivial 
factors, but no detailed explanation of all the observed peculiarities has been found. 

In the above description no mention has yet been made of the importance of the 
measuring current i. Where AM' was negligible, AM was accurately independent 
of i, but where AM' was appreciable, both AM and AM' usually increased with i, 
presumably due to the magnetic field of the current increasing the fraction of normal 
metal present. When the correction for AM' was not too big, it was found that the 
corrected AM was usually practically independent of i, which gave confidence in 
the reliability of the correction, and our procedure was to use the highest value of i 
which did not cause too big a value of Aif'. For about 0-01° K below i had to be 
less than 5 mA, and it was usually impossible to get reliable readings of M and M' 
closer than this to 2^^, both because only poor accuracy could be obtained with values 
of i below 5 mA, and because, owing to spread of the transition, AM' was nearly 
always large in this region. For lower temperatures the value of i could be increased, 
and usually it was possible to use 20 mA for temperatures more than 0*02° K below 
there was usually no gain in accuracy in going beyond 20 mA. 

Quantitative data 

A direct plot of AA against T as in figures 6 and 7 does not lend itself conveniently 
to quantitative discussion because of the very steep rise of AA close to Tq. The nature 
of the results is better displayed if use is made of the relation . 

(A/Ao)2=l/(l~(Tra, (8) 

* Our prel im inary observation that long exposure to air systematically increases AM' for 
tin, has not been confirmed. 



Penetration of magnetic field into superconductors. II 573 

which fits the mercury colloid results well (Shoenberg 1940; Daunt, Miller, Pippard 
& Shoenberg 1948). Prom this relation it follows that 

AA = Ao(1-(T/2;)^)~^-A(2-17"K), (9) 

so it is appropriate to plot experimental values of AA against z, where 

( 10 ) 

rather than against T itself. 

Good linear plots are in fact obtained, as shown in figures 9 and 10, for some 
examples which are discussed below; the intercept on the AA axis gives A (2-17°K) 
and the slope gives Aq, the penetration depth at 0° K, if the relation (8) is true. This 
proviso is necessary, since in fact most of the experimental points are for T close to 
iTg, and an almost equally good straight line (but of different slope) would be obtained 
if the 4 in (8) were replaced by 3 or 5, so that (8) cannot be regarded as more than 
roughly confirmed by our experiments. 

All the experiments have been analyzed by this method and the results are sum- 
marized in table 2. Usually a least-squares analysis has been made and the slope 
A(j and intercept, A (2*17° K), are given together with their standard deviations. 

Table 2 

in-phas€ 
correc- 
highest tion at 



date of 


Ao 

A {2-rK) 


value of 

highest 

specimen 

experiment 

Tc (°K) 

(cm. X 10®) 

(cm. X 10®) 

(cm. X 10®) 

z used 

(%) 

TO 

18. vii. 47 

3*712 

5*60 ±0*07 

5-4 ±0*2 

0*34 

6*24 

2 

T1 

28. i. 48 

3-714 

5*11 ±0*06 

5-8 ±0-2 

0*23 

5*26 

2 

T 1 etched 

11. ii. 48 

3*715 

17*2 

— 

— 

6-67 

0 

T 1 polished 

26. xi. 48 

3*715 

5*19 ±0*05 

5*1 ±0*2 

0*32 

7-65 

1 

T6 

27. ii. 48 

3*715 

5*32 ±0*09 

5-8 ±0*4 

0*50 

6*39 

10 

T9 

5. iii. 48 

3*716 

5*34 ±0*07 

5*5 ± 0*2 

0*29 

5*50 

9 

T9 

10. iii. 48 

3*717 

5*13 ±0*05 

5*4 ± 0*2 

0*20 

5*34 

9 

T9 

12. iii. 48 

3*717 

5*03 ±0*10 

5*4 ±0-3 

0*42 

5*25 

7 

T3 

4. ii. 48 

3*715 

5*63 ±0*06 

6*0 ±0*2 

0*23 

5*13 

10 

T 3 polished 

3. xii. 48 

3*716 

5*28 ±0*08 

5*3 ±0*2 

0*24 

4*53 

11 

T5 

18. ii. 48 

3*715 

5*67 ±0*13 

5*8 ±0*4 

0*41 

4*53 

2 

T5 

17. iii. 48 

3*715 

5*77 ±0*05 

6*0 ± 0-2 

0*17 

5-50 

8 

T5 

19. xi. 48 

3*715 

5*78 ±0*05 

5*8 ± 0-2 

0*33 

7*65 

7 

T 5 polished 

10. xii. 48 

3*716 

4*91 ±0*08 

4*4 ± 0-3 

0*49 

7*42 

13 

TIO 

5. xi. 48 

3*717 

6*5 

. — 

— 

4*53 

11 

M2 

25. vii. 47 

4*156 

10*0 

— 

— 

6*75 

33 

M3 

8. viii. 47 

4*158 

4* 10 ±'0*05 

4*5 ± 0*2 

0*28 

6*50 

9 

M4 

14. V. 48 

4*160 

5*5 

— 

— 

4*92 

12 

M7 

2. vi. 48 

4*154 

4*06 ±0*11 

4*6 ±0*5 

0*41 

4*85 

13 

M8 

4. vi. 48 

4*164 

17*0 

— 

— 

3*80 

11 

M9 

9. vi. 48 

4-158 

7*2 

— 

— 

6*90 

31 

M13 

18. viii. 48 

4*154 

4*48 ±0*09 

5*5±p*3 

0*51 

7*42 

0 

M14 

20. viii. 48 

4*156 

4-33 ±0*07 

5*5 ± 0*3 

0*47 

8*36 

3 

M15 

1. X. 48 

4*151 

4*08 ±0*02 

4*2 ±0*1 

0*14 

7*83 

2 

M16 

15. X. 48 

4*153 

4*34 ±0*04 

4*5 ± 0*3 

0*22 

8*64 

0 

M17 

10. xi. 48 

4*153 

4*22 ±0*03 

4*2 ±0*1 

0*18 

8*98 

1 


Notes. No values of the intercept A (2-1® K) are given where only graphical analysis has been made, since 
the accuracy would be poor. The percentage corrections quoted are the percentages of the original NM. 
values, which have been subtracted. 



574 


E. Laurmann and D. Shoenberg 



Figure 9. Variation of AA with z for tin. V T 1 as cast; 0 T 1 after acid etching; ATI after 
electrolytic polishing. The broken curve is based on the results of Pippard (1947). 



Figure 10. Variation of AA with z for some mercury specimens. □ M8; x M4; 0 M15; 
4 - M13; ami 6. To avoid confusion, some of the experimental points have been omitted. 




Penetration of magnetic field into superconductors, II 575 

Where the results are required, only for qualitative discussion, only the slope obtained 
from a graphical plot is quoted and no standard deviation is given. Before making 
a least-squares analysis each set of data was plotted graphically, and in some cases 
a few points for very high 2 J, which lay obviously off the straight line plot, were 
rejected. Such departures could be due either to small errors mToxT^ (for instance, 
3 % error in 2 ; at 2 ; = 7 is caused by 0-001° K error inT oxTfj or to inaccurate estima- 
tion of the in-phase correction, which usually increases with z (the obvious departures 
from linearity were usually most marked in experiments where this correction was 
large). For each series we quote the highest value of z used in the analysis, and the 
size of the in-phase correction at that point, which as already mentioned gives an 
idea of the possible systematic error. 

Discussion of besuxts 

As can be seen from table 2, the slope estimates of Aq are much more reliable than 
the intercept estimates of A (2-17° K), and so, apart from noting that the estimates of 
A (2-17°K) are of the right order of magnitude and usually reasonably close to 
Ao{l ~ (2-17 (this is l-04Ao for mercury and l-06Ao for tin), we shall make no 
further use of the intercepts.* The values of the standard deviation of any single 
point from the linear plot, are surprisingly low, ranging from 15 to 50 A; these corre- 
spond to from 0*0025 to 0*0085 /iS. expressed in mutual inductance, and as already 
mentioned (p. 563) are slightly lower than the precision with which any individual 
setting can be read. Where any specimen has been measured on more than one 
occasion even after a long interval the consistency is reasonably satisfactory (e.g. see 
results for T 5 and T 9), which gives added confidence in the sigmficance of the results. 

In our early experiments the large differetice in Aq between M 2 and M 3 sugg^ted 
that perhaps these two specimens had different crystal orientations and that Aq 
depended strongly on the direction of the superconducting current relative to the 
crystal axes (Laurmann & Shoenberg i 947 ).f Most of the subsequent experiments 
were directed to investigating this sugg^tion, and it soon became very unlikely that 
all the difference between M2 and M3 could be due to such an anisotropic effect. 
For instance, M 4, 7 and 8 all had approximately the same specific resistance at 80 K 
and so presumably the same kind of arrangement of crystals, and yet the values of 
Aq are very different. It became in fact much more likely that these differences and 
also the originally noted difference between M 2 and M 3 were due to sur&ce con- 
ditions. More direct evidence of the lack of any marked anisotropy was eventually 
obtained from the ‘good’ specimens M 13 to 17, which although clearly varying in 
crystal orientation show nearly equal values of Aq. 

The importance of surface conditions was strikingly brought out by the behaviour 
of T 1 after acid etching. As can be seen from figure 9 and table 2, the apparent pene- 
tration depth goes up by a factor of 3-4, presumably due to roughening of the surface, 
which effectively increases the perimeter round which the superconducting current 

* A bad reading at 2-17° K can cause a large error in the intercept without much affecting the 
slope; this is probably the cause of the high intercepts for M 13 and^ 14. 

t In the prelinainary note for M 2 was wrongly given as 4*165® K. 


Vol. tq8. a. 


3S 



576 E. Laurmann and D. Shoenberg 

flows. That the magnification is dne entirely to surface conditions was further 
demonstrated by the fact that the original results were reproduced when the surface 
was made smooth again by electrolytic polishing. It thus seems probable that the 
anomalously large values of found with some of the mercury specimens are due to 
cracking of the surface associated with sticking of the mercury to the quartz con- 
tainer as it cools (as already mentioned on p. 566). 

It is possible that surface conditions are responsible for a misleading appearance 
of anisotropy for the tin crystals too. Thus our first experiments indicated a 
difference between the results for the ‘perpendicular’ crystals T 3 and T5 (average 
Aq = 6*71 X 10“®cm.), and the ‘paraUeF crystals Tl, T6 and T9 (average 
Aq = 6*21 X 10"® cm.), which, even allowing for possible systematic errors associated 
with the in-phase correction, seemed significant. Once we were aware of the possi- 
bilities of mistaken interpretation due to poor surface conditions, we remeasured 
T 1, T 3 and T 5 after electrolytic polishing, and it can be seen that no significant 
difference remains (if anything, there is a suggestion of anisotropy in the opposite 
sense). It is interesting that although Aq for the perpendicular crystals is reduced by 
polishing there is no such reduction for the parallel crystal, suggesting that perhaps 
surface cracks occur more easily in perpendicular than in parallel crystals. Such 
cracks must presumably be on a submicroscbpic scale, since aU the crystals had the 
same highly polished appearance even before electrolytic polishing. It was hoped 
that T 10, prepared by the method which proved so successful with mercury, might 
have a more nearly perfect surface than T 3 and T 5, but actually there was no 
significant improvement. , 

The early measurements on the machined tin cylinder T 0 are included in table 2 
for completeness; the results for T 0 seem to indicate a slightly higher value of Aq 
than do the best crystals; this may be due to slight roughness or to the fact that 3J, was 
not directly measured and had to be estimated from the results themselves, which 
can easily cause a systematic error of a few per cent in Aq. 

If A is indeed slightly anisotropic, and A^ and Aq are the penetration depths for 
current flow perpendicular and parallel to the principal crystal axis respectively, the 
observed penetration depth for a ‘paraUeP crystal should be Ai (since the current 
flows everywhere perpendicular to the rod axis). For a ‘perpendicular’ crystal, 
however, the penetration depth varies round the circumference, and the observed 
penetration is an average between A^ and Aq; as shown elsewhere (Fraser & Shoen- 
berg 1949 ) ^be appropriate average for slight anisotropy is the arithmetic mean 
|(Ai+ Aq). More generally, for a crystal whose principal axis is inclined at angle ^ to 
the rod axis, we should find an effective penetration Aq given by 

^0 = i((^i + A3) + (Ai-A3)cos2^). (11) 

Thus any observed difference in Aq for parallel and perpendicular single crystals 
represents only half the difference between Aj and Aq. 

For tin, bearing in mind the rather large uncertainties due to in-phase corrections 
and assuming that the electrolytically polished specimens have perfect surfaces, thel 
difference between A^ and Aq is unlikely to exceed 20 %, and there is no definite 
indication which is the bigger. For mercury, considering only the good specimens 



Penetration of magnetic field into superconductors. II 577 

M 13 to 17, figure 11 shows that there is some correlation between Aq and the value 
of p at 80*^ K. As can be seen from (7) and (11), both A^, and p should vary in the same 
way with orientation, and it is probably justified to suppose that even for specimens 
consisting of several crystals (such as M 15) Aq and p are affected by any preferred 
orientation in the same way; thus we should expect, if A is anisotropic, to find a linear 
relation between A^, and p. On the basis of the straight line drawn in figure 11, it 
follows that Ai = 4-6 x 10“®cm. and A3 = 3*8 x 10“® cm., but this interpretation is 
speculative, since the random errors of the measurements and possible systematic 
errors due to surface conditions are of the same order as the observed differences of Aq. 



Fioubb 11 . Variation of A with p at 80° K for the ‘good’ mercury specimens M 13 to 17. 

The vertical lines indicate standard deviations. 

As regards absolute mean values, we estimate Aq — 5*2 x 10“®cm. for tin and 
4*3 X 10~®cm. for mercury (for a polycry^tal we should have Aq == -K^Ai-f A 3 )). If 
surface cracks are still present even in the polished tin and the ‘good’ mercury 
specimens, these may be overestimates, though the general agreement between 
specimens makes this improbable. It should be remembered, too, that Aq has been 
calculated on the assumption that (8) applies, which has been reliably confirmed only 
for mercury; if (8) breaks down at low temperatures, A^ will no longer mean the value 
of A at 0° K, but merely the parameter which fits the data to equation (9) best. 

These estimates of Aq for tin and mercury may be compared with some previous 
estimates. The only previous direct estimate of Aq for mercury was that of D4sirant 
& Shoenberg (1948) based on the susceptibility of thin cylinders and gave 
Aq = 7*6 X 10“®cm., which is nearly twice as big as the present estimate. Probably 
the discrepancy arises because of the troublesome effects described above which 
occur when mercury is cooled in a container. In the susceptibility method there was 
no ‘teU-tale’ such as the ‘in-phase’ effect to show that anything was wrong, and 
since a large number of cylinders were always used simultaneously, probably the 
ratio 7*6 to 4*3 represents some average magnification effect. Whether this magni- 
fication is due to a direct increase of perimeter caused by cracked surfaces or to the 
extra penetration in normal regions persisting below (the earth’s magnetic field 
was not compensated) cannot be decided. Indirect estimates of Aq for mercury have‘ 

3S-2 



578 E. Laurmann and D. Shoenberg 

been made by Pippard (1947) from the r.f. resistance and reactance at 1200 Me. /sec., 
and have also indicated a value of about 7-5 x 10~® cm. Since it is improbable that 
our estimate can be too low, this discrepancy suggests that Pippard’s interpretation 
of his R.F. results may need revision. 

In view of their high estimate of Aq, Desirant & Shoenberg’s (1948) discussion 
of Ginsburg’s (1945) interpretation of the critical field measurements on thin 
mercury films by Appleyard, Bristow, London & Misener (1939) requires revision. 
Combining the present results with the thin film data, Ginsburg’s theory gives 
(defined as Stt (a^ — IH% where and oc^ are the surface tensions at a boundary 
between an insulator and the normal and superconducting phases respectively) as 
7 X 10”® cm. at 0® K, and rising roughly proportionally to A with rise of temperature. 
This agrees qualitatively with Ginsburg’s own interpretation based on the thin 
film results alone. 

Por tin, there have been several previous measurements. The above-mentioned 
method of Desirant & Shoenberg was applied to one specimen of thin tin cylinders 
in pyrex capillaries and gave rather higher values of AA than the present experiments, 
probably partly for the same reasons as m the case of mercury, and partly because 
the specimen had a spread-out transition and an abnormally high value of A much 
more precise determination was that of Pippard (1947), based on the change of 
reactance of a superconducting resonator at 1200Mc./sec. when superconductivity 
is destroyed by a magnetic field. This method could not be applied directly to mercury 
owing to the variation of the normal resistance with temperature; the interpretation 
is far more direct than for the other R.F. measurements mentioned above. The tem- 
perature variation of AA deduced by Pippard is shown in figure 9,* and it can be seen 
that below 2: = 2 his results agree well with our T 1 results. The divergences at higher 
z may be due to inadequacy of the theoretical correction used to allow for the effect 
of ‘normal ^ electrons, which should, however, be important only for high z (when the 
proportion of normal electrons grows rapidly). Evidence for the inadequacy of the 
theory in this region comes from more recent work at 9200Mc./sec. (Pippard 1948), 
which shows that the theory, although qualitatively describing the results, does 
fail in quantitative details. 

The only other study of penetration effects in tin is that of Shalnikov & Sharvin 
(1948), who used a method in which the temperature of an ellipsoid in a steady mag- 
netic field was oscillated, and the resulting e.m.f. in a surrounding coil due to 
oscillations of penetration depth observed. Absolute values of A were deduced by 
integration and extrapolation processes, and show a good linear dependence on z, 
in agreement with (8). The value of Aq is, however, 11*7 x 10”® cm., which is more 
than twice our value. Since the method has no ‘ tell-tale ^ like our in-phase measure- 
ment to give warning of extraneous effects due to normal regions, and since the 
specimen used by Shalnikov & Sharvin was prepared in a way which might cause 
roughness of surface on a microscopic scale, it is unlikely that the observed high value 
is genuine. 

* It is probable that the specimen used consisted of a single crystal or a few large crystals, 
but nothing is known of the orientation. 



Penetration of magnetic field into superconductors. II 579 

Our absolute values of Aq, whicb the above discussion, suggests are the most 
reliable available, may be translated into an effective number of ‘superconducting’ 
electrons, Ug per cm.®, on the basis of the London & London (1935) relation 

A = («ic®/47rn.ge®)*, 

where m is the effective mass of such electrons. We can, therefore, deduce the value 
of the ratio 



where n is the number of atoms per cm.® and is the ordinary electronic mass. 
Putting in numerical values, we find r = 0-30 for tin and r = 0-35 for mercury at 
0° K. If we suppose m = this means that for both metals only about one electron 
from every three atoms is effective, or if ve suppose all the valence electrons are 
effective m = 5-lm^ for mercury and m = for tin. 

Depemd&nee of penetration depth on magnetic field 

It has been suggested that A may depend on the strength of an applied magnetic 
field, i.e. on the strength of the superconducting current (Ginsburg 1947). This can 
be investigated by lookiug for a change of M without a corresponding change of M' 
on application of a steady field H parallel to the specimen axis. To eliminate the 
change of M' caused by coupling of the circuit of the field-produciog solenoid, 
readmgs were taken both with and without the battery in circuit, but with the 
circuit closed in both cases. A good deal of trouble was encountered owing to small 
variations of solenoid current which caused xmsteadinessof the bridgezero,and though 
the use of a stud switch variable resistance rather than a rheostat to control the 
current improved the steadiness, it was possible to work with only small fields, 
i.e. only at temperatures close to Tg. 

Kgure 12 shows the results obtained with M 16 at 4-076°K. It can be seen tiiat 
an appreciable apparent change of A occurs only fairly close to the critical field H^, 
and that just there, small changes of A' (i.e. of M'; AA' is obtained from MI' by 
equation (4)) begin to occur. If, in the usual way, a correction for AA' is api^fied 
(in this case a factor of 1*04 is appropriate), practically no significant change of M 
remains up to 90 % of the critical field. Unfortunately, the measurement of A3f' 
were less accurate m this experiment than usual owing to the above-mentioned diffi- 
culties and to the nec^sity of using small measuring current amplitudes; tince, 
moreover, the correction factor is rather uncertain, the existence of a real effect is 
not quite excluded. AH that can be definitely said is that the change of penetraticm 
depth with field is certainly less than 2 x 10“® cm., i.e. less than 10 % of the penetra- 
tion depth itself up to 80 % of Eg, and there is no convincing evidence of any increase 
up to 90 % of Eg. , Beyond 90 % of Eg the losses rise so steeply that no ccmdusions 
can be drawn. 

RiTYiilflr results were found at 4*091 and 4*123°K for the same spedmen, and for 
M 15 at 4*122‘’K, wMIe-for M3 a steep apparent rise of penetiation depth, began 
much earlier (at 60 % Eg) but was accompanied by a steep r^ of MI', so that it is 



680 


E. Laurmaim and D. Shoenberg 

probably not significant. Presumably these rises in Aif' as is approached have the 
same meaning as for approach to JJ. in the temperature-variation curves, namely, 
a growth of the firaction of metal in the normal state. 



H/H, 

PiGtJBB 12. Changes of A and A' with field for M 16 at 4*075° K. The critical field Hq was 15*6 
gauss, and the amplitude of the field of the measuring current was 0*5 gauss or O-OSifc- 

This negative result on the effect of a steady magnetic field in increasing penetra- 
tion depth suggests that the previous possible indication of such an effect found by 
Desirant & Shoenberg (1948) must have been due to some other cause. 

We wish to thank Professor E. N. da C. Andrade for valuable advice on the 
preparation of mercury crystals, Mr J. V. Smith of the Crystallographic Laboratory 
for the X-ray studies of the mercury crystals, Mr J. W. Menter of the Physics and 
Chemistry of Rubbing Solids Laboratory for electron diffraction studies of tin 
crystals, and Mr A. B. Pxppard for valuable discussion of our results. 

References 

Andrade, E. N. da C. & Hutchiags, P. J, 1935 Proc. Roy. Soc. A, 148, 120. 

T. S., Bristow, J. R., London, H. & Misener, A. D. 1939 Proc. Roy. Soc, A, 

172, 540. 

Bridgeman, P. W. 1935 Proc. Amer. Acad. Sci. 60, 305. 

Casimir, H, B, G. 1940 Physica, 7, 887. 

Daunt, J. G., Miller, A. R,, Pippard, A. B. & Shoenberg, D. 1948 Phys. Rev. 74, 842. 
D6sirant, M. & Shoenberg, D. 1948 Proc. Phys. Soc. 60, 413. 

Fraser, A. R. & Shoenberg, D. 1949 Oamh. Phil. Soc. (in the Press). 




581 


Penetration of magnetic field into superconductors. II 

Ginsburg, Y. 1945 J. Fhys. U.S.S.R. 9, 505. 

Ginsburg, Y. 1947 J, Phys. U.S.S.B, 11, 93. 

Griiaeisen, E. & Sckell, O. 1934 Ann. Phys., Lpz., 19, 387. 

Keesom, W. H. 1932 Leiden Comm. SuppL 71 d. 

Laurmajm, E. & Shoenberg, D. 1947 Nature, 160, 747. 

London, F, & London, H. 1935 Proc. Roy. Soe. A, 149, 71. 

Pippard, A. B. 1947 Proc. Roy. Soc. A, 191, 399. 

Pippard, A. B. 1948 Nature, 162, 68. 

Sckell, O. 1930 Ann. Ph/ys., Lpz., 6, 932. 

Shakukov, A. I. <fc Sharvin, Y. Y. 1948 Bull. Acad. Sci. UJi.S.S. (Phys. Series, in Bnssian), 
12, 195. 

Shoenberg, D. 1940 Proc. Roy. JSoc. A, 175, 49. 


Electroviscosity. IV. Some extensions of the theory of flow 
of liquids in narrow channels 

By G. a. H. Elton and E. G. Hibschleb 
Chemistry Department, Battersea Polytechnic 

(Communicated by J. Kenyon, F.R.S. — Received 26 March 1949) 

In this paper the previous theoretical work on eleotroviscosity is extended and modified, 
with special reference to the simplifying assumptions necessary in the derivation of the 
equations. 

1. iNTEOBTTCTIOlSr 

In previous papers in this series (Elton 1948a, 19486, 1949), it has been shown that 
when an electrical double layer of ions at an iuterface in an ionic liquid is sheared, 
the electrical resistance to shear may, in certain circumstances, be an important factor 
affecting the rate at which flow takes place in the liquid. For instance, when an ionic 
liquid flows through a narrow channel in a soM, it may exhibit an apparent viscosity 
equal to several times its normal bulk viscosity. On the basis of various simplifying 
assumptions, equations were derived to show how the apparent viscosity of a liquid 
varies imder such conditions. For example, it was shown (1948 a) that when an ionic 
liquid flows between two similar parallel plates of infinite extent, separated by a 
distance 2h, then the apparent viscosity liquid is given by 

where y is the normal viscosity of the liquid, e is the dielectric constant, ^ the electro- 
kinetic potential at the solid-liquid interface, k the specific conductivity of the liquid 
in the tube, and A is the Debye-Huckel expression for the effective thickness of the 
double layer. When A equation (1) reduces to 



582 


G. A. Elton and E. G. Hirschier 

Two simplidfying assumptions were made in the derivation of equation (1 ), namely, 
(a) that in calculatmg the charge distribution in the double layer it was permissible 
to write mykeijrjkT = where e is the electronic charge, k the Boltzmann 

constant, T the absolute temperature, and is the potential at any point in the 
double layer; and (6) that the potential at the axis of the tube was zero. 

It will be shown that for a single surface, the first of these approximations, although 
strictly only a limiting one, nevertheless results in an expression for the potential 
distribution which is in good agreement with that obtained by use of the full 
expression for values of ^ as high as 150 mV. The assumption (6) then becomes 
unnecessary, provided that the agreement persists in the case of two surfaces. 


2. The potential distkebtjtion a single strEPAOE 

Conadering the ions of the double layer as point charges the Boltzmann distribu- 
tion equation leads, for uni-univalent electrolytes of ionic concentration n, to the 
expression 

/>^ = -2rae8inh^. 


where ^ is the potential at a distance x from the sohd-liquid interface, and is the 
corresponding excess charge density. 

When this is combined with the Poisson equation 

d^\jr 47rpa. 


the following equation is obtained: 




^nne 

e 


kT 


( 3 ) 


If the substitution e^/feT = smh&t/rjkT is made, this equation becomes 


_ Snne^ 
dx^ ~ ekT 



( 4 ) 


IVom these two equations (3) and (4) it follows that for any given value of 
dh/r[dx^ is greater when calculated from (3) than when calculated from (4), and that 
dfjdx as calculated from (3) is more negative than as calculated from (4). At the 
surface the condition is that f - ijr^, a, fixed value. Hence i}r as calculated from (3) 
drops more rapidly at first than as calculated from (4), but the slope, di^jdx, obtained 
from (3) more rapidly levels out than that obtained from (4). Since at a: = 0 both 
values of ^ are the same, and these two effects tend to cancel out, the values of ’>jr 
never become very far separated. 


(i) Solution of eguation (3) 

Multiptying equation (3) throughout by dtjrldx and integrating 


SnnkT . edr _ 


(«) 



Electroviscosity. IV 


583 


is obtained, where S is an integration constant. If when x = co,i/r = 0 and d'^jdx = 0, 

_ SnnkT 


and 

Therefore 

i.e. 


dxir J87mkT\i . , eilr 
-tt. = ±2| — - — J smh^. 


dx 


2kT’ 


h 


d/ijr 


smh.ei/rl2kT e 


{SnnkT]^ 


ij*'- 


logtonhi = ±y+0. 


When X = 8, yjr — ijr^, hence C = — logtanhe^i^o/ifeT and 

X _ tanhe^/^feT 
A~ °^tanhe^o/4fe2’ * 

(the negative root is taken, since for all values of x,rJfK: and 

tanh &lrl4kT < tanh etjr^l4kT). 


Hence 




e-A = tanh^/tanh^.. 


( 6 ) 


(7) 


( 8 ) 


( 9 ) 


Putting A = tanhe^o/4feT, this may be written 


Hence 


^IS*T — 


1+ J.e~^^ 
1—Ae~^l^' 




2kT . 1+ J.e~^^ 


( 10 ) 

( 11 ) 


(ii) Solution of eqvation (4) 
This is a standard equation whose solution is 


tJt = 

with the boundary conditions x = 0,ijr = i/tq, x = ao,^ = 0, this leads to 

fr = (12) 

Curves showing the relation between rf and x are drawn in figures 1 to 4. The values 
of were calculated from the equations (11) and (12) respectively, using different 
values of the potential at the surface (= Q. These curv^ show that the values 
of as calculated from equation (12) are in good agreement with those calculated 
from equation (11) for values of ^ as high as 150 mV. It may therefore be inferrml 
that equation (4), and hence equation (12) also, give a good approximation to the 
true equations of the potential distribution. 



584 


G. A. Elton and F, G. Hirschler 



xjX 

6 refers to equation (12) x refers to equation (11) 

Figube 1. In equation (11) A = 0-5, Figthre 2. In equation (11) A = 0-75, hence 

hence = 56* 8 mV. ~ 100*7 mV. 



fl?/A xjX 


O refers to equation (12) 

Figure 3. In equation (11) -4 = 0-9, hence 
^0 = 152*5 mV. 


X refers to equation (11) 

Figure 4. In equation (11) 4L = 0*99, 
hence ijr^ = 274 mV. 


3. Potential distbibtjtion between two infinite 

PARALLEL PLATES DISTANCE 2 h APART 


The full equation (3) cannot be solved in general terms in this case. The boundary 
conditions that ^ w^hen a; = 0; ^ and dijrjdx = 0 when ic = A; and ^ 
when X = 2^, change equation (6) to 


dijr 

dx 




i 


(13) 



Electroviscosity. IV 


685 


where y == ei/rjkT, and u = eiJr^lkT. This equation, on integration, gives an elliptic 
integral involving ^ may then be evaluated for various values of x. The fonction ^ 
is of the form 




X 

2A^(sina') 


+ C, 


(14) 


where logsina' = e'^^jkT and logsin^ = — and 0 is an integration 

constant. 

The problem may be solved if the approximation sinheiJrlkT = eT/rjkT is made, 
and equation (3) reduced to equation (4). It has been shown in § 2 that this approxi- 
mation leads to good results for a single surface. Its use for the potential distribution 
between a pair of parallel plates is therefore not yet justified, but it may be concluded 
that for plates not extremely close together the equations obtained represent the 
potential distribution quite well. The present theory is in any case invalid for plates 
in very close proximity (where A/A is less than 1), for then the number of ions con- 
cerned is generally no longer statistical, especially when the effect of the ionic atmo- 
sphere in gathering together of the ions is remembered. These last considerations 
limit the use of the theory to plates at a moderate distance apart, at low concentration 
because of the high value of A, and at higher concentrations because of effect of non- 
statistical numbers of ions between the plates (A and hence also h being small). 

Under these limitations equation (4) will now be solved for the case of a pair of 
parallel plates of infinite extent. 

Multiplying the equation throughout hj di/rldx and integrating 


l/dtY . X 


is obtained. Putting in the condition d'^ldx = 0, ^ at a; = 
root, and integrating again 

■ = log 






h, taking the square 
(15) 


is obtained. Solving this equation for we obtain, for values of x between 0 and A, 


■<lr = \A' e“®/^ + ^ 


(16) 


where A' = ' 

On putting the condition ^ — when a: = A in equation (15), the following relation 
between and rjrQ is obtained: 


fm = 


to 


coshA/A' 


(17) 


4. DeEIVATIOISi OB' THE EQUATION OF HEOW BETWEEN FABAIIiEL MATES 

As previously shown (1948a), the quantity of liquid Q flowing per second ibxough 
a section of the system of tmit length and breadth under a pressure gradient P is 

C2h 1 rzArPai eE' “I 

g = Vdx = -j^ [^(2A-a:) + ^(^^r-^o)Jda:, 


( 18 ) 



686 


G. A. Elton and E. G. Hirschler 


where v is the rate of flow at distance x from one plate, and E is the streaming 
potential gradient set up in the plane of flow. Substituting for ijr from equation (16) 

/•2A fh 

smce by symmetry I vdx = 2j vdx. 

Equation (19) on integration gives 


2Ph^ 


Zr, 

4779/ L 


»a:+AA'e-“=A- 




JO 


Let the reduced rate of flow result in an apparent viscosity 7}^, where 




2PA3 






and ^0 = C- 


( 20 ) 


Then 


Va=‘V + 


_ AA'(1 - e-^A) - 1)"] 

32,TT^Kh? _ 2 2A' ’ 


( 21 ) 


where A' = From equation (21) we can calculate values of ria when 

is not zero. The required values of may be obtained from equation (17), or 
they may be calculated from the full equation by means of the solution ( 14). Values 
various values of hjX and t/tq obtained in this way, are given by Verwey & 
Overbeek (1948). When is substituted from equation (17) equation (21) is 
reduced to 

3g2r2 / 

(22) 


It should he noted here that the term ^ appearing in equation (22) is not a con- 
sequence of the approximation e^jkT = sinhevS^/*T, but one factor ^ comes from 
the expression for the streaming potential E, and the other from the integration 

j^dx. Hence the errorintroduoed by the approximation appears in the finn.1 equation 

only in the magnitude of the error in the integral From the figures 1 to 4 it will 

be seen that for values of ^ less than 150 mV the areas under the two curves differ 
only very little. In table 1 values of E' the ratio of ri^—7] calculated from equation 
(22) to those calculated from equation (1) are given for different systems. For 
moderate values of A/A tte agreement is good. 


5. Derivation oe plow equation between parallel 

PLATES SEPARATED BY A LARGE DISTANCE 

For plates separated by a distance 2A, where A/A is greater than 10, the poten- 
tial at the centre of the tube, is less than x^^th of the potential at the surface. 
Consequently, the assumption that the double layers at the two surfaces do not 
interfere is valid. The electroviscous effect may then be calculated without making 
the approximation sinhev>/fer == efIkT. 



Electroviscosity. IV 


587 


Table 1 


h 

electrolyte 

* 


(cm.) 

normality 

h/X 

F' 


(1 X 10-« 

0-32 

0-14 

1 X lO-® 

J 1 X 10“5 

1 1 X 10-^ 

1-03 

3-24 

0-70 

0*98 


[4x lO"^ 

6-49 

1*00 


fix 10-8 

0-97 

0-65 

3xl0-s 

] 1 X 10-8 

3-08 

1*00 


[l X 10-^ 

9-73 

1*00 


fix 10-8 

1-62 

. 0*95 

SxlO-s 

4 1 X 10-8 

5-13 

1*00 


[lx 10-^ 

16-2 

1*00 


Since there is no interference the potential distribution at both the surfaces is 
the same, and the same as that for a single double layer, hence the distribution is 
given by equation (11). Substituting for tJt in equation (18) from equation (11), 
expanding the logarithm and integrating, the expression 


Q = 


2PP eEr/r^h 2ekTXE ... .... 

— .4(1 -e-'^^) 

3ii 27T1I enif |_ , 


/ 4 ® ~1 

(1 (1 -e-sw) + ... 


is obtained. Substituting for Q, E and ^0 as before and rearranging 


3e%^ 


(22a) 


6. l^w nr A CTLorDBicAL TXTBB. = 0, sioh efjkT = e^jkT 

This case has been examined in a previous paper (1948a). It is proposed here to 
give a full solution of the problem, and to demonstrate that the equation reduces 
in the limit to that obtained previously by an approximate method. 


(i) Potential distribution in a cylindrical tube 

The Poisson-Boltzmann distribution equation for a cylindrical tube is, when 
sinhe^/ifeT = eijrlkT, 

\ d [ dtlrX ^ 


where ijr is the potential at a point distant r from the axis of the tube. In part I of 
this series equation (9) is not a correct solution to this dififerential equation. . 

This solution should be 




(24) 


where /o(^A) ^ Bessel function of imaginary argument of the first kind and zero 

order and J?o(r/A) is a Bessel function of the second kind and zero order. M and N 
are constants of int^ration. 



588 


G. A. Elton and E. G. Hirschler 


For a cylindrical tube of radius a, when ^ = 0 for r = 0 and = i/r^ for r = a 
N = 0 and 

®ioWA)' 


f = ^o- 


(ii) Derivation of the flow equation 

The quantity of liquid passing through the tube in unit time is given by 

Q — ( 2TTrvdr, 


where v, the rate of flow of liquid at distance r fix)m the axis, is given by 

P{a^—r^) eE 


V = 


-Mo-f)- 


(26) 


(27) 


4^ 47r' 

Substituting from equations (25) and (27) in equation (26) and integrating, we obtain 


Q = 


nPa* eE-^Q 1*“ 


2, jo I /o(«/A)j 

Integrating the second term by parts we obtain 

^ ^ 12 /o(«/A)Jo 

which on use of the properties of Bessel functions reduces to 

0 e-g^pfflT , 2A Ji(g/A) ~| 

^ 8v 49 L a4(«/A)J- 

If the reduced rate of flow results in an apparent viscosity we obtain, on putting 


Q = ^ and fa = 


89a 


^mrjaK 


'^2nW[: a-Zo(a/A)J' 


(30) 


For values of a/A greater than 9, the first two terms of the asymptotic expansions 
of the Bessel functions give accurate values. Hence 

2A/i(a/A) 2A (1 - 3A/8a) 
a laiajX) ~ a (l + A/8a) ‘ 

Using the binomial theorem this reduces to 

2A /i(a/A) _2A/ ^AW M 2A/ A\ 
a Jo(a/A) a\ 8a j \ 8a/"" a ^ 2a/’ 


Hence 

and 


^ ■2A Ji(a/A) ^ 2A A^ / AV 

aJ-„(a/A) a'^a®“\^ a] 


Va = V + 


27r®m* 




27T^Ka^ ' 


(31) 



589 


ElectroviscosUy. IV 


In this equation, as in equation (22) of § 4, the term does not arise from the approxi- 
mation sinhe^/feT = e^jkT, but one factor comes from the expression for E, the 


strea ming potential gradient, and the other from the integral As shown i 

§ 6 th.e error involved is therefore of the order of the error in the integral 


7 . Conclusion 

It has been shown in this paper that^ the Debye-Huckel approximation, 
sinb e'^jkT = ei/rjkT, leads to potential distribution equations at ain glA flat surfaces 
which are in good agreement with those calculated from the fuU expression, for 
values of the electrokinetic potential up to 160 mV. It has been deduced that the 
electroviscous effect between parallel plates calculated from these equations is not 
greatly in error for all plate separations for which viscosity can be treated as a 
statistical phenomenon. An expression for the electroviscous effect in a cylindrical 
capillary tube was obtained, making the assumption that the potential at the fl -gig 
is zero. By analog with the case of parallel plates tjus assumption will be good if 
the ratio radius of capillary to double-layer thickness is greater than 10. The expres- 
tion is useful for all capillaries likely to be encountered in practice. For example, in 
a capillary of radius 1/t, may still be appreciably larger than y. The error here will 
be larger than for the case of a corresponding parallel-plate system, since the 
potential at the axis wiU be higher than in the case of parallel plates. 

The authors wish to express their thanks to Dr K. L. Sutherland and Dr W. E. 
Ewers for their interest and assistance in this work. 


Befebnnoes 

Elton, Q. A. H. 1948a Proc. Soy. Soc. A, 194 , 269 . 

Elton, G. A. H. 19486 Proc. Soy. Soc. A, 194 , 276 . 

Elton, G. A. H. 1949 Proc. Soy. Soc. A, 197 , 668. 

Verwey, E. J. W. & Overbeek, J. Th. G. 1948 Theory of the stability of lyophobie colloids. 
London: Elsevier Publishing Company Tno. 



INDEX TO VOLUME 198 (A) 

Anisotropic metals, plane plastic strain (Hill), 428. 

Auluck, F. C. & Kothari, D. S. A note on tke Riesz method and the method of residues 
170. ’ 

Autoxidation of tetralin (Bamford & Dewar), 252. 

Baker, J, F., Home, M. R. & Roderick, J. W. The behaviour of continuous stanchions 
493. ’ 

Bantford, C. BE. & Dewar, M. J. S. The autoxidation of tetralin, 252. 

Barber, N. F. The behaviour of waves on tidal streams, 81. 

Behaviour of continuous stanchions (Baker, Horne & Roderick), 493. 

Behaviour of waves on tidal streams (Barber), 81. 

Bell, R. P., Gelles, E. & MoUer, E. IGnetics of the base-catalyzed halogenation of some 
ketones and esters, 308. 

Bennett, J. G., Brown, R. L. & Thring, M. W. Unified field theory in a curvature-free 
five-dimensional manifold, 39. 

Benzylamine C-N bond, dissociation energy (Szwarc), 285. 

Birks, J. & Bradley, R. S. The rate of evaporation of droplets. II. The influence of 
changes of temperature and of the surrounding gas on the rate of evaporation of 
drops of di-7i-butyl phthalate, 226. 

Birth and growth of explosion in liquids and solids initiated by impact and friction 
(Bowden & Gurton), 350. 

Bleaney, B., Penrose, R. P, & Plumpton, B. I. Paramagnetic resonance in the copper 
Tutton salts, 406. 

Bowden, F. P. & Gurton, 0. A. Birth and growth of explosion in liquids and solids 
initiated by impact and fiiction, 360. 

Bowden, F, P. &; Gurton, 0. A. Initiation of solid explosives by impact and friction : the 
influence of grit, 337. 

Bradley, R. S. See Birks & Bradley. 

Bradley, R. S. & Shellard, A. D. The rate of evaporation of droplets. III. Vapour pressures 
and rates of evaporation of straight-chain parajQSn hydrocarbons, 239. 

Brown, R. L. See Beimett, Brown & Thring. 

Catalytic hydrogenation of methyl elaeostearate, and of mixtures of elaeostearic with 
other polyethenoid long-ohain esters (Hilditoh & Pathak), 323. 

Copper Tutton salts, paramagnetic resonance (Bleaney, Penrose & Plumpton), 406. 

Comer, J. The effect of diffusion of the main reactants on flame speeds in gases, 388. 

Dewar, M. J. S. See Bamford & Dewar. 

Diffraction of blast. I (LighthiU), 454. 

Diffusion in gases and liquids. I, IE (Yang), 94, 471. 

Dissocmtion energy of the C-N bond in benzylamine (Szwarc), 285. 

Dissociation energy of the N-H bond in hydrazine (Szwarc), 267. 

Dixon-Lewis, G. Studies in polymerization. V. The polymerization of vinyl acetate, 510. 

Eddy diffusion of water vapour and heat near the ground (Pasquill), 116. 

Eden, R, J. Heisenberg's S matrix for a system of many particles, 540. 

Effect of diffusion of the main reactants on flame speeds in gases (Comer), 388 

EleotoTOconty. TV. Some extensions of the theory of flow of liquids in narrow channels 
(Elton & Hirschler), 581. 


[ 590 ] 



Index 591 

Elton, G. A. H. & Hirschler, E. G. Electroviscosity. IV. Some extensions of the theory 
of flow of liquids in narrow channels, 581. 

Explosion in liquids and solids (Bowden & Gurton), 337. 

Explosion in liquids and solids (Yofle), 373. 

Elame speeds in gases (Comer), 388. 

Frank, F. C. & Merwe, J. H. van der. One-dimensional dislocations. I. Static theory, 
205. 

Frank, F. C, & Merwe, J. H. van der. One-dimensional dislocations. II. Misfitting 
monolayers and orientated overgrowth, 216. 

Frost, R. A note on polar air-mass modifioation, 27. 

Gelles, E. See BeU, GoUes & MoHer. 

Gurton, 0. A. See Bowden & Gurton. 

Harington, Sir Charles. The work of the National Institute for Medical Research, 293. 

Heats of formation of free CN and free CHg, and the relationship between i)(CO), D(CN) 
and jD(N 2 ) (Long), 62. 

Heisenberg’s S matrix for a system of many particles (Eden), 540. 

Helium II, film phenomena (Temperley), 438. 

Hilditch, T. P. & Pathak, S. P. The catalytic hydrogenation of methyl elaeostearate, 
and of mixtures of elaeostearic with other polyethenoid long-chain esters, 323. 

Hill, R. The theory of plane plastic strain for anisotropic metals, 428. 

Hirschler, F. G. See Elton & Hirschler. 

Horne, M. R. See Baker, Horne & Roderick. 

Hot-wire investigation of the wake behind cylinders at low Reynolds numbers (Kovdsz- 
nay), 174. 

Hydrazine N-N bond, dissociation energy (Szwarc), 267. 

Ignition of solid explosive media by hot wires (Jones), 523. 

Influence of entrapped gas on initiation of explosion in liquids and solids (Yoffe), 373. 

Irdtiation of solid explosives by impact and friction: the influence of grit (Bowden 
& Gurton), 337. 

Jones, E. The ignition of solid explosive media by hot wires, 523. 

Jones, Sir Harold Spencer. The Royal Greenwich Observatory, 141. 

Kaminski, A. & McBain, J. W. Spontaneous emulsification of pure xylene in an aqueous 
solution through mere adsorption of a detergent in the interface, 447. 

Ketones and esters, halogenation (Bell, Gelles MoUer), 308. 

Kinetic theory of diffusion in gases and liquids. I. Diffusion and the Brownian motion 
(Yang), 94. 

Ednetic theory of diffusion in gases and liquids. II. General kinetic theory of liquids 
mixtures (Yang), 471. 

Ednetics of the base-catalyzed halogenation of some ketones and esters (Bell, Gelles 
& MoUer), 308. - 

Kothari, D. S. See Auluck & Kothari. 

Kovasznay, L. S. G. Hot-wire investigation of the wake behind cylinders at low Reynolds 
numbers, 174. 

Laurmann, E. & Shoenberg, D. Penetration of magnetic field into superconductors. 
II. Measurements by the Cashnir method, 560. 

Lennard-Jones, Sir John. The molecular orbital theory of chemical valency. I. The 
determination of molecular orbitals, 1. 



592 Index 

Lennard- Jones, Sir John. The molecular orbital theory of chemical valency. II. Equivalent 
orbitals in molecules of known symmetry, 14. 

Lighthill, M. J, The diffraction of blast. I, 454. ^ 

Liquids, flow (Elton & Hirschler), 581. 

Long, L. H. The heats of formation of free GN and free CHg, and the relationship between 
i){CO), D(CN) and 62 . 

McBain, J. W. See Kaminski & McBain. 

Merwe, J. H, van der. See Frank & van der Merwe. 

Molecular orbital theory of chemical valency. I. The determination of molecular orbitals 
(Lennard- Jones), 1. 

Molecular orbital theory of chemical valency. II. Equivalent orbitals in molecules of 
known symmetry (Lennard- Jones), 14. 

MoHer, E. See Befl, GeUes & MoUer. 

National Institute for Medical Eesearch (Harington), 293. 

Note on polar air-mass modification (Frost), 27. . 

Note on the Riesz method and the method of residues (Auluck & Kothari), 170. 

Nye, J. F. Plastic deformation of silver chloride. I. Internal stresses and the glide 
mechanism, 190. 

One-dimensional dislocations. I, Static theory (Frank & van der Merwe), 205. 

One-dimensional dislocations. II. Misfitting monolayers and oriented overgrowth (Frank 
& van der Merwe), 216. 

Paramagnetic resonance in the copper Tutton salts (Bleaney, Penrose & Plumpton), 406. 

Pasquill, F. Eddy diffusion of water vapour and heat near the ground, 116. 

Pathak, S. P. See Hilditch & Pathak. 

Penetration of magnetic field into superconductors. II. Measurements by the Casimir 
method (Laurmaim & Shoenberg), 560. 

Penrose, R. P. See Bleaney, Penrose & Plumpton. 

Plastic deformation of silver chloride. I. Internal stresses and the glide mechanism (Nye), 
190. 

Plumpton, B. I. See Bleaney, Penrose & Plumpton. 

Polar air:mass modification (Frost), 27. 

Polymerization (Dixon-Lewis), 510. 

Rate of evaporation of droplets. II. The influence of changes of temperature and of the 
surrounding gas on the rate of evaporation of drops of di-Ti-butyl phthalate (Birks 
& Bradley), 226. 

Rate of evaporation of droplets. III. Vapour pressures and rates of evaporation of 
straight-chain paraflSn hydrocarbons (Bradley So Shellard), 239. 

Riesz method and method of residues (Auluck So Kothari), 170. 

Roderick, J. W. See Baker, Horne So Roderick. 

Royal Greenwich Observatory (Jones), 141. 

Shellard, A. D. See Bradley So Shellard. 

Shoenberg, D. See Laurmann So Shoenberg. 

Silver chloride, plastic deformation (Nye), 190. 

Solid explosives, ignition (Jones), 523. 

Solid explosives, initiation (Bowden So Gurton), 337. 

Spontaneous emulsification of pure xylene in an aqueous solution through mere adsorp- 
tion of a detergent m the interface (Kaminski So McBain), 447. 



Index 


693 


Stanchions, behaviour (Baker, Horne & Roderick), 493. 

Studies in polymerization. V. The polymerization of vinyl acetate (Dixon-Lewis), 510. 
Superconductors, penetration of magnetic field (Laurmann & Shoenberg), 560. 

Szwarc, M. The dissociation energy of the C-N bond in benzylamine, 285. 

Szwarc, M. The dissociation energy of the N-N bond in hydrazine, 267. 

Temperley, H. N. V. A theory of the film phenomena of liquid helium ii, 438. 

Tetrahn, autoxidation (Bamford & Dewar), 252. 

Theory of plane plastic strain for anisotropic metals (Hill), 428. 

Theory of the film phenomena of liquid helium n (Temperley), 438. 

Thring, M. W. See Bennett, Brown <& Thring. 

Tidal streams, waves on (Barber), 81. 

Unified field theory in a curvature-free five-dimensional manifold (Bennett, Brown 
& Thring), 39. 

Wake behind cylinders, hot-wire investigation (Kovdsznay), 174. 

Water vapour and heat, eddy diffusion of (Pasquill), 116. 

Work of the National Institute for Medical Research (Harington), 293. 

Xylene, spontaneous emulsification (Kaminski &; McBain), 447. 

Yang, L. M. Einetic theory of diffusion in gases and liquids. I. Diffusion and the 
Brownian motion, 94. 

Yang, L. M. Kinetic theory of diffusion in gases and liquids. II. General kinetic theory 
of liquids mixtures, 471. 

Yoffe, A. Influence of entrapped gas on initiation of explosion in liquids and solids, 373. 


EHD or THE OHE HTJHDBED AND lOTTETY-EiaHTH VOLtnyDB (SBRIBS A) 




l.A.R.1. 75 


INDIAN AGRICULTURAL RESEARCH 
INSTITUTE LIBRARY, NEW DELHI. 


Date of Issue 

Date of Issue 

Date of Issue 

7 ! U 

% 


1 ? jn7e 





. 



• 



































GIPNLK— H-40 1.A.R 1.-29-4-55—15,000 







